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These lectures describe the use of effective field theories to extrapolate results 
from the parameter region where numerical simulations of lattice QCD are possi- 
ble to the physical parameters (physical quark masses, infinite volume, vanishing 
lattice spacing, etc.). After a brief introduction and overview, I discuss three top- 
ics: 1) Chiral perturbation theory (XPT) in the continuum; 2) The inclusion of 
discretization effects into XPT, focusing on the application to Wilson and twisted- 
mass lattice fermions; 3) Extending XPT to describe partially quenched QCD. 



1. Overview and Aims 

More than 30 years after Wilson introduced lattice QCD 1 , and more than 
25 years after Creutz's pioneering numerical studies of non-abelian gauge 
theories 2 , we can now simulate lattice QCD, including quarks, with param- 
eters that approach their physical values. This is the result not only of 
advances in computer power but also of improvements in algorithms and 
actions. 51 In particular, we can simulate QCD with pion masses of 250 MeV 
or lower, with the minimum value depending on the choice of fermion ac- 
tion. Such masses should allow a controlled extrapolation to the physical 
pion masses, one that can give errors at the few percent level 3 . One of the 
aims of the field is to provide results with this accuracy for many hadronic 
quantities, allowing both tests of the method and predictions for unmea- 
sured quantities. This goal has begun to be attained 4 . 

Despite the successes just outlined, it is important to keep in mind the 



* Lectures given at ILFTN Workshop on "Perspectives in Lattice QCD", Nara, Japan, 
Oct 31-Nov 11, 2005. 

"Some of these improvements are reviewed in Tony Kennedy's lectures. 
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limitations of LQCD (lattice QCD). Simulations are, and will remain for 
the foreseeable future, limited in scope — one or two particle states in a box 
unlikely to exceed L = 3 — 5 fm, with lattice spacings unlikely to be smaller 
than a w 0.05 fm, and pion masses unlikely to drop below 200 MeV. b In 
order to connect these results to those for physical quark masses in the 
continuum and infinite volume limits one needs a quantitative theoretical 
understanding of how to extrapolate. Such an understanding can be pro- 
vided by chiral perturbation theory generalized to include discretization 
errors, and is the topic of these lectures. 

One way to think of this situation is that LQCD is a powerful tool with 
several adjustable parameters ("knobs"). While we are able to turn these 
knobs independently (unlike in the physical world where they are fixed), we 
cannot turn them to their physical values. Thus we are stuck simulating 
theories with unphysical values of the parameters, and we need additional 
theoretical input. 

In fact, there are several other knobs (beyond quark masses, a, L and 
«em) that we can adjust independently. We can use different sea and va- 
lence quark masses — giving partially quenched (PQ) theories — or we can 
go further and use different actions for valence and sea quarks — "mixed 
action" simulations. An interesting example of the latter is to use va- 
lence fermions with good chiral symmetry (Domain- Wall or Overlap) and 
cheaper sea quarks (staggered or Wilson- like) . Both PQ and mixed action 
theories are "really" unphysical: they not only have unphysical values of 
the parameters but they are also not unitary. Nevertheless, they are well- 
defined Euclidean statistical systems, with long-distance correlations, and 
it is plausible that they can be described by an effective chiral theory. Fur- 
thermore, in both cases there are points in parameter space for which the 
theories are physical, which "anchor" the effective theories. I will discuss 
this in detail for PQQCD in sec. 4, and for now only illustrate the situa- 
tion with Fig. 1. The aim is to use the freedom provided by having extra 
knobs, which are relatively cheap to turn, in order to improve the accuracy 
of the extrapolation to the physical point: "physical results from unphysi- 
cal simulations" 6 . This is an essential feature of the MILC collaboration's 
work on decay constants and quark masses 3 . 



b There are important exceptions, such as the very small lattice spacings used to match 
QCD with a b quark onto heavy quark effective theory"", which are possible because L 
can also be reduced. 

c For percent accuracy one must also account for the effects of electromagnctism. 
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A different example of unphysical theories is the use of "rooted" stag- 
gered fermions. Each staggered flavor leads to four degenerate "tastes" in 
the continuum limit, and the standard approach to obtain a single contin- 
uum fcrmion per flavor is to take the fourth root of the fermion determinant. 
As the taste symmetry is broken for a^O, this rooting leads, however, to 
a non-local single-flavor fermion action on the lattice . The implications of 
a non-locality that formally vanishes as a — > are controversial — does one 
remain in the universality class of QCD? I will not discuss this issue here, 
but only note that the theory at a ^ is undoubtedly unphysical, and the 
chiral-continuum extrapolation can only be done if one has an effective the- 
ory which describes the unphysical features. This is provided by "(rooted) 
staggered xPT" 8 ' 9 ' 10 . Whatever the outcome of the rooting controversy, 
this is another example of using effective field theory to obtain physical 
results from unphysical simulations. 

Due to limitations of time and space, I will discuss only a subset of 
applications of XPT to LQCD in these lectures. I begin with a brief in- 
troduction to XPT in the continuum, with an emphasis on lessons for the 
lattice. I follow that with the example of incorporating discretization errors 
into XPT for twisted-mass fermions, which includes Wilson and improved 
Wilson fermions as a subset. In this case the theory is physical, but gives 
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a nice example of the power of adding an extra knob (the twist angle) and 
of the utility of XPT. Finally, I discuss PQXPT, i.e. chiral perturbation 
theory for PQQCD. 

I will mainly focus on the theoretical set-up and on general issues of the 
applicability of XPT. I will not provide a review of the status and accuracy 
of the state-of-the-art extrapolations. My hope is that this introduction to 
the tools will allow the reader to critically assess current work. 

2. Review of %PT in the continuum 

In this section I describe the construction of the chiral Lagrangian in the 
continuum. There are many good books and lectures on this topic. I 
have found those by Donoghuc, Golowich & Holstein 11 , Ecker 12 , Georgi 13 , 
Kaplan 14 , Kronfeld 15 , Manohar 16 and Pich 17 very useful. 

2.1. Effective Field Theories in general 

In these lectures I consider two examples of effective field theories (EFTs) : 
XPT as an EFT for QCD, and Symanzik's effective continuum theory for 
lattice QCD (the latter to be discussed in sec. 3.4). Thus it is useful to 
begin with a discussion of EFTs in general. If you are unfamiliar with the 
subject then some of this section may be hard to follow in detail, but my 
aim is to begin with a broad-brush sketch, which will be filled in later. 

The generic situation is that we have an underlying theory in which 
there is a separation of scales. In the theories of interest we have: 

XPT: ^ ~ ^m p ,m N ; (1) 

Symanzik: p qua rk , Pgiuon ~ A QC d < n/a . (2) 

Note that in the former case there is a separation of masses, with the 
"pions" (by which I mean the light pseudoscalars: ir, K and 77) being lighter 
than all other hadrons, while in Symanzik's theory we choose to consider 
momenta much smaller than the lattice cut-off. In both cases there is a 
good reason to split off the low-scale physics. For XPT it is because the 
pion sector changes most rapidly as we approach the chiral limit (as we will 
see in detail). For Symanzik's theory we want to understand the impact of 
lattice spacing errors on the quarks and gluons which dominate the non- 
perturbative contributions to hadronic quantities. 

Crudely speaking, we now introduce a momentum cut-off A lying be- 
tween the two scales, and "integrate out" the high-momentum degrees of 
freedom. This process is illustrated schematically in Fig. 2. It leaves only 
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pions with low momenta in XPT, and continuum-like quarks and gluons 
in the Symanzik theory. These degrees of freedom interact via vertices 
that are quasi- local, with a physical size A -1 . This quasi-locality follows 
because we consider only external momenta satisfying p <C A, so the high- 
momentum degrees of freedom are always highly virtual. The vertices are 
then expanded in powers of p/A, yielding local operators with increasing 
numbers of derivatives. The low-momentum modes themselves can become 
nearly on shell (or exactly on shell if we continue to Minkowski space) , but 
the resulting analytic structure of correlation functions (leading to cuts in 
Minkowski space) is maintained in the effective theory. 




Figure 2. Generation of effective field theories. 



This description is impractical to implement in most cases. In particu- 
lar, we do not know how to integrate out quarks and gluons from QCD an- 
alytically to yield a theory of pions, since confinement is a non-perturbative 
phenomenon. Even in the Symanzik theory, where one might have expected 
that quarks and gluons with p ~ n/a would have been perturbative since 
1/a > 2 GeV (a < 0.1 fm), it turns out that accurate results mostly require 
non-perturbative calculations. d The beauty of the EFT method, however, 
is that we do not actually need to do the integrations. Instead, following 
Weinberg 18 we can rely on the general properties of EFTs. If the underly- 



d This is found when implementing the improvement program for Wilson fermions 
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ing theory is physical, its S-matrix will be unitary, Lorentz-invariant, satisfy 
cluster decomposition, and transform appropriately under the internal sym- 
metries. These properties must be maintained by the EFT, which is, after 
all, designed to reproduce the S-matrix of the low-momentum degrees of 
freedom. The only known way to do this is with a local, Lorentz-invariant 
Lagrangian. It should be constructed solely from the low-energy degrees of 
freedom, and satisfy the same internal symmetries as the underlying theory. 
All possible terms consistent with these symmetries must be included — this 
precludes the need to explicitly integrate-out degrees of freedom, at the 
price of introducing unknown constants. 

One notable feature of the resulting £ e g is that it is not rcnormalizable, 
and thus valid only over a limited energy range. This is an intrinsic part of 
the construction: we know that the EFT breaks down when p > A. Non- 
renormalizability does not, however, imply a lack of calculability. As we 
will see, one can expand quantities in powers of p/A, with a finite number 
of unknown coefficients at each order. The limitations of the method are 
then (a) the need to introduce unknown coefficients and (b) an unavoidable 
truncation error. This error, however, decreases as the separation in scales 
increases (i.e. as -^Oora^O). 

As just described, the justification of EFT is based on properties of 
the S-matrix, and thus rooted in Minkowski space. While this is fine for 
the development of continuum XPT (my first topic), the natural objects in 
lattice simulations (my second topic) are Euclidean (finite-volume) corre- 
lation functions of local operators. In particular, the discretization errors 
are constrained by the symmetries of a Euclidean lattice. Thus an alter- 
native approach to developing and justifying an EFT is needed. This has 
been provided for the case at hand by Symanik 20 , using an extension of 
renormalization theory. The result (established to all orders in perturba- 
tion theory) is that the recipe given above still applies: keep all local terms 
consistent with the symmetries of the underlying theory (in this case the 
discrete symmetries of a Euclidean lattice). My third topic, PQQCD, is 
also strictly limited to Euclidean space, but here neither of the previous 
justifications apply, and one must make further assumptions. 

Because my second and third topics involve Euclidean theories, I have 
chosen to couch the discussion of the first (XPT) also in Euclidean space. 
This allows later sections to build on the earlier notation. In fact, it is 
perfectly legitimate, having determined the Minkowski-invariant local ef- 
fective Lagrangian, to rotate this to Euclidean space. The result will be 
the most general Euclidean-invariant local Lagrangian (consistent with the 
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other symmetries, which are unaffected by the rotation). This Lagrangian 
will reside in the functional integral which generates the Euclidean correla- 
tion functions of the theory. 

2.2. Chiral symmetry in QCD and its breaking 

Without further ado, let me turn to the first concrete example, XPT. The 
fermionic part of Euclidean Lagrangian for QCD is given by 

£qcd = QlPQl + QrPQr + Q L MQ R + Q R M^Q L . (3) 

where I have included only the N = 3 light quarks, Q tr — (u,d,s), Q = 
(u, d,s). I will also consider the N = 2 theory without the strange quark. 
Left- and right-handed fields are defined with projectors P± = (1 ±75)/2, 
and are Ql,r = P^Ql,r and Q L R — Ql.rP±- There is no problem with 
Q L and Ql being defined with the different projectors, since Q and Q are 
independent fields. The key fact is that, in the massless limit, left- and 
right-handed quarks can be rotated independently, so the Lagrangian has 
a G — SU(3)l x SU(3)r chiral symmetry under which 

Ql.r. > Ul rQl r , Q l .r^Q l ,rUI r : U L ,ReSU{3) L ,R. (4) 

There is also the overall vector U (1) symmetry, which counts quark number, 
while the apparent axial U(l) symmetry is broken by the anomaly. 

Quark masses enter through the mass matrix M, which is conventionally 
taken to be M = diag(m u , m^, m s ) = M\ These masses break the chiral 
symmetry: any non-zero values violate the axial symmetries (those with 
Ul — U R ), while the vector symmetries (Ul — Ur) are broken unless 
the masses are degenerate. We can, however, formally retain the chiral 
symmetry of Lqc d by treating M as a complex "spurion field" transforming 
as M — > U L MU R and M f — > UrM^Ul. This is a convenient trick for 
keeping track of the symmetry-breaking caused by the mass term. 

Since chiral symmetry is key to all that follows, and quark masses break 
this symmetry, we must require that M be small. What does small mean? 
One criterion is that M should be small compared to the QCD scale, m q <C 
Aqcd ~ 300 MeV. A more precise criterion will arise from xPT: m n! K,ri 
A x ee iirfir rj f200 MeV. ft follows that in physical QCD, with (m u + 
nid)/2 fa 4 MeV, SU(2)l x SU(2)r is a very good approximate symmetry, 
while SU(3)l x SU(3)r is more badly broken since m s w 100 MeV and 
rriK.q ~ A x /2. This brings up an important question for lattice applications 
of XPT: can approximate chiral symmetry can be used to determine the 
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strange quark mass dependence when mjf* w m s l If not, then we can only- 
use chiral symmetry to guide extrapolations in m u and mj. 

Chiral perturbation theory is an expansion about the chiral limit, M = 
0, so I first discuss massless QCD. It is expected in this theory that the exact 
chiral symmetry is spontaneously broken by the vacuum. This is based on 
an accumulation of evidence about QCD itself: the lightness of the w, K and 
r] are consistent with their being pseudo-Nambu-Goldstone bosons (PGBs) 
of the broken approximate chiral symmetry of real QCD; the absence of 
(approximate) parity-doubling in the hadron spectrum, e.g. m^(P = +) 7^ 
mjv(P = — ), as would be required by an unbroken (approximate) chiral 
symmetry; and the accumulated successes of XPT, especially in the SU{2) 
sector. It is also expected that the order parameter for chiral symmetry 
breaking is the condensate: 6 

{QQ} = ((QlQR + QrQl)) ~ A| CD ^0, q = u,d,s. (5) 

The vector symmetry is not expected to be spontaneously broken, based 
on the presence of approximate SU (3)y multiplets in the hadron spectrum, 
and on theoretical considerations 23 . This implies that the condensates are 
equal in the massless theory, (uu) = (dd) = (ss). 

The form of the condensate in eq. (5) is a convention. There is in 
fact a manifold of equivalent vacua related by chiral transformations, and 
parameterized by the orientation of the condensate in flavor space: 

n« - (QL,i, a ,cQ Rlj , a , c ) ^u L nu f R . (6) 

Here I have shown the summed Dirac and color indices explicitly (a and 
c, respectively), since Q and Q are in the opposite order from usual. This 
order makes the flavor matrix structure (indices i and j) transparent, and 
is particularly useful when discussing PQXPT below. The assumption of 
unbroken vector symmetry implies that Ojj = lu 5ij must lie in the manifold, 
and so the general point is ljUlU^. In this language, chiral symmetry 
breaking is equivalent to u> being non-zero, for then the vacuum is left 
invariant only by a subgroup H of G: 

SU(3) L x SU(3)r SU{3) , (7) 

G H 



e There has been some controversy over whether the condensate is the dominant order 
parameter 21 , but this standard picture is now strongly favored 22 , and is supported by 
lattice calculations of (qq), so I will accept it here. 
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The nature of H is simplest using the conventional vacuum orientation, 
Ctij — wSij, with u> = —(qq). Then H = SU(3)v (with Ul = Ur), while 
the axial transformations with Ul = Ur are broken (J7 = uj — > ujU^). 

Goldstone's theorem then implies that there are 8 massless Nambu- 
Goldstonc bosons (NGB — labeled 7r b , and corresponding to the ir, K, and 
i]), each coupled to one of the eight broken axial generators. For the con- 
ventional vacuum orientation, one has 

(Ap)IQ7M7 5 T a Q(0)|0) = -if^S ba , (8) 

with T a being SU(3) generators. The masslessness of the NGB follows from 
the fact that rotations of the condensate with four-momentum p cost zero 
energy as p — > 0, in which limit they become global rotations. 

We now have the ingredients with which to construct an EFT: a separa- 
tion of scales (mcB = compared to the scale of other hadron masses — m p , 
m' , JTiproton, etc., with mi ia d « 1 GeV), and a knowledge of the symmetries 
of the underlying theory. The EFT will contain only NGB as dynamical de- 
grees of freedom, and should be valid as long as pqb -C Whad- The spurion 
field M should also be added to include the effects of quark masses/ 

Representing NGB fields is probably the most conceptually non-trivial 
step of the construction, because (a) the underlying theory is written in 
terms of quarks rather than mesons (one is not just "thinning" degrees of 
freedom, one is also changing basis) and (b) the choice is not unique 24 ' 25 . 
Because of point (b), the strategy is simply to find any representation which 
works — it turns out that by field redefinitions one can then switch to other 
choices as desired. 

Since there are no precise rules to follow, it is useful to proceed by anal- 
ogy. To this end, I recall the canonical example of spontaneous symmetry 
breaking: a complex scalar field with a "Mexican hat" potential 

y = V^ + A(^) 2 /2- (9) 

There is a G = U{1) phase symmetry, <p — * e M 0, which is spontaneously 
broken if /i 2 > 0, for then there is a non-zero vacuum expectation value 
(VEV) | (4>) | = v = J /i 2 / A. The vacuum manifold consists of the phase of 
((f)), and is thus U(l). The symmetry breaking is G — ► H = 1, implying 
a single NGB corresponding to phase rotations. The EFT for this massless 
mode is analogous to that we wish to write down for the pions in QCD. 



f The EFT can also contain static sources, representing heavy particles with m > mh a d, 
off which the NGBs can scatter. These can represent the interesting cases of vector 
mesons, baryons, or heavy-light hadrons. 
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The advantage of this theory, compared to QCD, is that we can directly 
construct the EFT by integrating out heavy fields, as long as A is small 
enough that we can use perturbation theory. Having done so, we can see 
how the EFT could be obtained using the symmetries alone, and use this 
to guide the construction for QCD. 

I decompose the field in a way which differs from that used, say, in 
studying the Higgs: 4>{x) = ve p ^e ze ^ rather than (f>(x) = v + h(x). This 
choice picks out the NGB degree of freedom, 8, explicitly, and allows the 
phase symmetry to act linearly: e ie ^ — > e la e lfl ' I l If we integrate out the 
heavy radial degree of freedom, p, we obtain an effective Lagrangian in 
terms of e 10 . But we do not need to do any work to determine the general 
form of £ e ff — we need only require locality, reality, Euclidean and 17(1) 
invariance. The result is 

£eff = c 2 d^(e ie )d^e- ie ) + c 4 d^e w )d^e-* 9 )d„(e w )d„(e- w ) + ... , (10) 

where Cj are unknown constants. 8 Terms without derivatives on every factor 
of e ±l6 can be brought into the form shown (up to total derivatives) using 
e i9 e -i9 _ j anc j |_ ne a belian nature of the group. The result is a massless 
NGB having interactions proportional to p*. It is an interesting exercise 
to check the latter result in perturbation theory using the conventional 
expansion in terms of h(x) — the p A arises from cancellations between non- 
derivative interactions. 

We learn two things from the U(l) example. First, to use the exponen- 
tial of the "pion" fields, since it transforms linearly under G, and simplifies 
the implementation of the symmetries. Second, that a "fixed radius" form 
(\e t6 \ — 1 above) automatically includes only the NGB, excludes heavy de- 
grees of freedom, and enforces the spontaneous breakdown of the symmetry 
(i.e. U(l) is broken for any value of (9)). 

The QCD analog of {<p) is the condensate flij of eq. (6) — both map out 
the corresponding vacuum manifolds. The analog of fixed length angular 
fluctuations is obtained by promoting Cl/u) to a dynamical field £ i:) (x), 
corresponding roughly to fluctuations in the condensate. Just as VL/lo is in 
SU(3), so is S, and the chiral transformation properties are the same: 

£(z) E SU(3) : £(&) _^ U L T,{x)U ] R . (11) 

Any VEV of S breaks G to H = SU(3), leading to the desired number of 
NGBs. The fixed radius of E (i.e. ££t = vjtvj = i) implies that the only 



s Since U(l) is abelian, C e g can be simplified to C2(<9 M 0) 2 + C4[(9 (J 6) 2 ] 2 . I do not pursue 
this as similar manipulations fail for the non-abclian chiral groups relevant for QCD. 
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degrees of freedom in E are the NGBs. For example, if (E) = 1 we can 
expand as 



E(z) = cxp(2ffl(x)/7) = cxp{2in a (x)T a /f) , a=l,8, (12) 



in terms of the eight "pion" fields and a constant / to balance dimensions. 
Note that although E transforms linearly, this is not the case for the pion 
fields (e.g. Sn a contains terms with any odd powers of 717,). Thus construct- 
ing £ e ff in terms of the pion field directly would be very difficult. 

2.3. Constructing the pionic effective Lagrangian 26 
2.3.1. Building blocks for £ e ff 

We are now in business. The ingredients are E and Et, as well as the 
spurions Af and Aft. I recall their transformation properties under the 
chiral symmetry group G = SU(3)l x SU(3)r: 



E-EfiEEft, ^^U R ^U f L , M^U l MU r , Aft - U R M^U R . (13) 



It is useful to construct objects which transform solely under the left-handed 
(LH) or right-handed (RH) sub-groups (and which I call respectively LH 
and RH building blocks), since they simplify enumeration of operators: 11 



where I have repeatedly used the unitarity of E. From the fact that 
dct(E) = 1 one learns that and are traceless, e.g. 

= ^(dctE) = cycxptrlnE) = dctE tr^EE" 1 ) = -tr(L M ) . (14) 

Thus and ("Weyl derivatives") are elements of the Lie algebra, su(3). 

Another symmetry of QCD is parity. Since E ~ QMr and Et ~ qpq L , 
the transformations in the EFT are [with xp = (—a?, x±)] 

E(x) Et(xp), Af <-» Aft, Li(x) -R l {x P ), L 4 (x) R±(x P ). (15) 

If one expands about (E) = 1, as in eq. (12), then the pion field transforms 
as H(x) — > —H(xp). There are also C and T symmetries, which I do not 
show explicitly. 



LH: L„ = Ed M Et = -d M EEt 
LH: AfEt — > U L {MT^)UI , 
RH: = E^E = — c^E^E 
RH: AftE — ► U R (M^)U R , 



-it _^ UlL^UI 
EAft — > U L (EMi)Ul 
-Rl — U R R,U R 
EtM — ► U R {^M)U R 



h I use the convention throughout that derivatives only act on the objects immediately 
to their right. The arrows implicitly denote transformation under G. 
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I now enumerate terms which are local, real and satisfy the symmetries 
of QCD. These are just products of the building blocks above, with LH and 
RH blocks combined separately into traces to make them invariant under 
G. In fact, for the terms I display, one need only use LH building blocks 
as the results equal their "parity conjugates" (p.c). Since in the end we 
expand in powers of momenta, it is useful to classify terms according to 
the number of derivatives. Similarly, as M is treated as small, one should 
classify according to the number of spurion insertions. We will see that one 
should usually count two derivatives for each spurion. 

There are no non-trivial terms without derivatives or spurions: these 
would be constructed from tr[(EE^)™] or powers or detE, but both are 
constants. Euclidean invariance rules out a single derivative. The only 
independent term with two derivatives is 

1. tr(L M L„) - -tr(^S^St) = tr^i^), 

while the only term with no derivatives and one mass insertion is 

2. tr(M£t) + tr(EAft) . 

There are five terms with four derivatives: 

3. [tr(L„L M )] 2 

4. tr^L^tr^L,) 

5. tr{L tl L^L l/ L l/ ) [not independent for two light flavors]] 

6. tr(L tl L u L^L l ,) [not independent for 2 or 3 light flavors]] 

7. The Wess-Zumino-Witten (WZW) term involving e Miy(9cr 27 ; 

two terms with two derivatives and one mass insertion: 

8. tr(Lf,L^ tr(MEt + EAft) 

9. tr(L^[MEt + EMt]) ; 

and three terms with two mass insertions: 

10. [tr(A#Et + EMt)] 2 

11. [tr(MEt - EMt)] 2 

12. tr(MEtME+ + M^EM^T,) . 

Each of these terms appears in L c g with an independent unknown coeffi- 
cient, except for terms 5 and 6 which, as noted, arc not independent for 
certain chiral groups. I will not discuss the interesting structure of the 
WZW term, as it is complicated, and does not contribute to the simple 
processes considered here. I will also not continue the enumeration beyond 
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this point. This has been done as part of next-to-next-to- leading order 
(NNLO) calculations 28 , but is beyond the scope of this introduction. 

With the enumeration of operators in hand, I turn to the predictions. 

1 will assume a power counting with d 2 <~ M and justify this a posteriori. 
At leading order (LO) we have (the superscript on C counting derivatives): 

4 2 ) = L\x (fyEfyEt) _ Z^ tr ( M £t + SMt) , (16) 

where the unknown "low energy constants" (LECs) have been given their 
standard names / and Bo- Since we are expanding about massless QCD, 
the only scale that can appear is Aqcd, so we expect / <~ B <~ Aqcd- Up 
to this stage, M is a complex spurion field. To include the effects of quark 
masses we set it to its physical value: M — > M = diag(m„, m^, m s ) = Mq. 
This makes the potential — the second term in eq. (16) — depend on the 
direction of E, and we must determine its VEV by minimizing this potential. 
In other words, the quark masses, which break the chiral symmetry, pick 
out a preferred direction in the vacuum manifold. If all quark masses can 
be chosen positive (as is apparently the case in reality) , then one finds that 
the VEV is (E) = 1. I stress that this result is convention dependent: we 
could choose M — UlMqU^, in which case (E) = UlUr. 

2.3.2. Brief aside on vacuum structure 

It is instructive to consider the vacuum structure of two flavor theory in a 
little more detail. Then we can write E = exp(i9n ■ r), implying E + E^ = 

2 cos 6 x 1, and thus V (2) oc — tr(M) cos#. This is minimized by (E) = 1 if 
trM > 0, and by (E) = -1 if trM < 0. Note that at LO all that matters 
is the average quark mass trM/2; the difference m u — rrid does not enter. 
There is a first order phase transition when trM changes sign at which 
the condensate flips sign but maintains its magnitude. Note, however, that 
the physical theory is the same for both signs of trM: one can go between 
them with a chiral rotation satisfying UlU^ = — 1. I will discuss how 
discretization errors effect this transition in sec. 3.9 below. 

With three flavors (or any odd number), the situation is more compli- 
cated because E = — 1 is not an clement of SU(3). Changing the sign 
of M leads to a different theory (one with the original M plus a 9— term 
with 8 = tt). Without going into details, I show in Fig. 3 the phase struc- 
ture if m s is fixed and positive while the other two masses change. The 
shaded region is where CP is spontaneously broken. In the real world we 
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Figure 3. Phase structure at LO with 3 flavors and fixed m 3 (from Creutz 29 ). 

are very likely in the right-hand upper quadrant, but it is striking that such 
interesting physics lurks not far away and is contained in the LO potential. 

2.3.3. Properties of pseudo-Nambu-Goldstone bosons at leading order 

Assuming positive quark masses so that (£) = 1, we can study pion prop- 
erties by inserting eq. (12) into Z> 2 ' and expanding: 

4 2) = tr(^na M n) + ^tr([n, d„n][n, d^u]) + o([dn] 2 n 4 ) 

2,13 

+2B tr(Affl 2 ) - — |tr(Mn 4 ) + OfAffl 6 ). (17) 
3/ 2 

We can now understand the choice of factors multiplying the kinetic term 
in (16): they are chosen so that the pion kinetic term is correctly normal- 
ized [using tr(T a T b ) = 5 ab /2}. If M = [the first line of (17)], the pions 
are massless as required by Goldstone's theorem, and their interactions 
all involve derivatives, as exemplified by (<9IT) 2 n 2 term shown. Note that 
the non-abelian nature of the group allows there to be interaction terms 
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with two (as opposed to four) derivatives, in contrast to the U(l) example 
described above. This interaction term is non-renormalizable, as are sub- 
sequent terms involving more pion fields, with the dimensions balanced by 
factors of /. 

Including M, the pions become massive pseudo-Goldstone bosons 
(PGB). XPT predicts at LO that the pion mass squared is proportional 
to the quark mass. The explicit form is simplest for degenerate quarks: 

= 2Bom q . This answers the following potential puzzle: how can physi- 
cal quantities like be related to scheme- and scale-dependent quantities 
like m q l The answer is that Bq cancels the scheme dependence in m q . 
This works for all the terms in the second line of (17), as they contain the 
common factor BqM. For this reason it is useful to give the combination a 
name, specifically \ q = 2B m q . 

The spurion terms also give rise to higher-order, non-renormalizable 
interactions among pions. Note that all vertices in contain an even 
number of pions, because is invariant under II(a;) <-> — H(x). This is 
an accidental symmetry, which does not correspond to a symmetry of QCD 
(note that it differs from parity), and is broken by NLO terms in XPT. 
More precisely, it is broken by the WZW term, which allows, for example, 
interactions between five PGB. 

LO XPT makes a number of predictions. First, it is clear that once / and 
X have been determined (e.g. from PGB masses and scattering amplitudes), 
all higher order vertices are predicted. But there are also predictions from 
the structure of the quadratic and quartic interactions alone. The former 
give relations between PGB masses, that latter between pion scattering in 
different channels (e.g. / = 0, 1, 2 in the two-flavor theory). I will discuss 
the mass relations here. 

First we need to place the physical particles in the pion matrix n. This 
can be done using the vector symmetry corresponding to diagonal phase 
rotations, U(l)v, u x U (l)v,d x U(l)v,s € U(3)v, which is unbroken by quark 
masses, and under which the pion field transforms linearly: II — > Uy^lUy. 
The tt^~ , K ± , K° and K° are charged under these symmetries 1 (e.g. the n + 
has u-ness +1 and d-ness —1), and so live in definite off-diagonal positions 
in II. Isospin then determines the position of the ttq, and orthogonality 



'In the following, I refer to all such particles as "charged" , having in mind this generalized 
definition rather than electric charge. 
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that of the r\. Including normalizations, the result is 

/tt" | r, 2± k+ \ 

n = S -%±ib S • ( 18 ) 

Inserting this into — 2B a tr(Mn 2 ), we find that charged particle masses are 
proportional to the average mass of the quarks they contain: m 2 q . q . — 
B (mi + rrij), i ^ j. While there are no predictions (masses of three pairs 
of CPT conjugate mesons are given in terms of three quark masses), one 
can determine quark mass ratios from the experimental PGB masses, e.g. 

m 2 K+ +m 2 K0 mi + m s ( m u + m d \ 

— ^ — 9 — = — z ~ 13 [ me = . (19) 

2ml + 2me \ 2 J y ' 

This implies m s /me w 25, up to NLO XPT and electromagnetic (EM) 
corrections. This is how we know that the strange quark is so much heav- 
ier than the up and down quarks. The corresponding determination of 
(m u — md)/me (or equivalcntly m u /md) from {m K+ — mj c0 )/m 2 + fails, 
however, since the NLO corrections (A(m u /m d ) oc m s /Aq CD 30 ) and EM 
contributions are potentially as large as the LO term in XPT. A determina- 
tion of m u /md can be achieved by a direct lattice calculation of the meson 
masses using non-degenerate quarks together with an estimate of the EM 
contributions. The most accurate results at present 3 are m u /m,d = 0.43(8) 
and m s /mt = 27.4(4). 

The first predictions of XPT occur in the neutral sector. The tt° and rj 
mix, but with an angle 9 <~ (m u — md)/m s that (despite the uncertainty in 
m u /md) we know to be very small. Thus 

mlo = ml + + O(0 2 m 2 K ) + . . . , (20) 
m% =(2[m 2 K+ +m 2 KQ ]-ml + )/Z+0{e 2 m 2 K ) + ... . (21) 

(548 MeV) 2 (566 MoV) 2 

Both predictions are well satisfied. The first, that of approximate isospin 
symmetry for the pions, holds not because m u /md ~ 1 (which is not true), 
but because (m u — rrid) /Aqqd -C 1. The second is the famous Gell-Mann- 
Okubo (GMO) relation. 

2.3.4. Lessons for lattice simulations 

(I) Leading order XPT works to <~ 10% in GMO relation, despite the fact 
that this is a three-flavor relation involving the strange quark. This gives 
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hope that three-flavor XPT can be used to extrapolate from the lattice 
simulations that are presently being undertaken, which have mP hys /2 > 
w^ at > 2m phys and m' s at « m phys . The issue is whether the NLO corrections 
are generically this small, and I return to this below. 

(II) Assuming the validity of XPT, the ratio m 2 + /m q determines the phys- 
ical Bo (in whatever scheme the quark mass is defined in) even when the 
quarks are degenerate and have masses differing from (usually larger than) 
their physical values. Such a determination is an example of obtaining 
physical results from simulations with unphysical parameters. It works as 
long as there are N = 3 dynamical quarks: Bq (like all LECs) depends on 
N, and so one must simulate with the same number as in QCD. Of course, 
one also needs NLO corrections in XPT to be small. 

(III) The isospin limit, m u = md, is close to physical QCD. Working in 
this limit simplifies simulations, e.g. by reducing the number of adjustable 
parameters, and by canceling disconnected contributions to neutral corre- 
lators such as in the 7r° propagator: 




The error one makes in hadron masses by setting m u = is generically 
~ (m u — m^/AQCD ~ 1%, comparable to those from EM contributions. 
Of course, once one can attain 1% accuracy in the isospin limit, further 
improvement requires the calculation of disconnected and EM contributions 



2.3.5. Power counting in XPT (M = 0) 

I now turn to the questions of power counting and predictivity of non- 
renormalizable theories: how are contributions ordered, and by what factor 
are higher order terms suppressed? As already noted, the ordering turns 
out to be in powers of momenta-squared and mass insertions, so the NLO 
effective Lagrangian, C^ 4 \ contains the terms proportional to d 4 , d 2 M 
and M 2 enumerated above. Setting M = to simplify discussion, and 
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expanding, one finds, schematically: 

~ fMWW) ~ (an) 2 + ^p^- + ... (22) 



"(an) 4 n 2 (sn) 4 



(23) 



where Lql are unknown dimensionless LECs, first enumerated by Gasser 
and Leutwyler 26 . Taking irir scattering as an example (with, say, dimen- 
sional regularization to avoid power divergences), the contributions up to 
quartic in momenta are: 



r (2) . 

^trcc ■ 



x 42, ^kc— m 



£ i-ioo P : )^ZI>^' v ><0 1 \ ~ (f) 



ln(p 2 /p 2 ) 



where I have shown only a representative loop diagram, and the positions 
of the "p's" in the diagrams indicate whether they refer to external or loop 
momenta. Contributions from at tree-level, and at one-loop, are 
proportional to p A (up to logs), where p is a generic external momentum. 
These terms are suppressed relative to the tree-level contribution from 
by p 2 /f 2 (up to logs). 

It is straightforward to generalize from this example to a power-counting 
scheme, using the fact that each pion field brings with it a factor of l/f. 
One finds that, for all processes involving PGB, the expansion parameters 
are (up to logs) p 2 /f 2 and (if we reintroduce M) x/f 2 ~ m2 /f 2 - At 
LO contributions come from C[ 2 l c , at NLO from c[^l c and £i_ loop , and 

at NNLO from 4rL (£ (2) + £ (4) )i-ioo P and C (2 \ oop , etc.. These are, 
respectively, "trivial" , "easy" and "hard" to calculate (though the latter is 
done), with NNNLO being "very hard". 

We can now understand the nature of predictions from XPT. At each 
order there are a finite number of LECs (2 at LO, 10 at NLO, 90 at NNLO). 
We pick an order to work at, say NLO. We determine the LECs from the 
appropriate number of physical quantities, and then make predictions for 
all other quantities. These predictions will be accurate up to truncation 
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errors (of NNLO size in our example) . In the continuum, these errors can 
only be estimated. On the lattice, we can attempt to fit them as part of 
the extrapolations. 

The example discussed above also illustrates the general relationship 
between terms analytic and non-analytic in p 2 and M. Non-analytic terms 
(often called "chiral logs") come from pion loops and involve lower order 
vertices (e.g. LO vertices in the NLO calculation of the example). In par- 
ticular, they do not involve the LECs of the order being worked at, and in 
this sense are predictions. They do, however, depend on the renormalization 
scale /i, as exemplified by the p ln/i term in the example. A physical result 
cannot depend on /i, and indeed the dependence can be canceled by renor- 
malizing the LECs: Lql — > Lql([i). Renormalization theory shows that 
all divergences can be removed in this way as long as all terms consistent 
with the symmetries are included in £ ff, which was, of course, our starting 
point. Thus the LECs can be thought of as counterterms which contain 
our ignorance about the short distance parts of loops, crudely speaking the 
parts with \p\ > /i. It is then natural to set /i w rrihad, since Whad ~ 1 GeV 
is the scale above which we know the EFT to be ineffective. 

One can argue for a natural value for this "cut-off" scale from within 
XPT itself, and at the same time estimate the LECs at this scale. The above 
example should make plausible the result that the LECs satisfy renormal- 
ization group equations of the form (ILgl / dln(fi) = 0(1) x (Att)~ 2 . The 
(47r) -2 comes from the four-dimensional loop integral. Now we do not know 
the value of the Lql at the natural matching scale \i ~ Whad, nor do we 
know what this matching scale is within a factor of two or so. But we 
do know how the LECs change with /x, |Lgl(2/x) — Lcl(^)\ ~ l/(47r) 2 , 
so that even if Lql ~ at a possible matching scale, it would be of or- 
der l/(47r 2 ) at another. This motivates assuming that the natural size is 
\Lgl(ij>)\ ~ l/(47r) 2 . This can be generalized into a self-consistent scheme 
to all orders. Of course, this argument does not rule out LECs with larger 
magnitudes, but it turns out that those LECs that have been determined 
from experiment have the expected magnitude. 

If we make this assumption, then the NLO contributions are suppressed 
relative to the LO by Lip 2 /f 2 ~ P 2 /A 2 (and m 2 /A 2 when we include M), 
with A x = 47r/. We will shortly see that / w f % , in which case A x = 
1.2 GeV. This value is completely consistent with our original expectation 
that XPT would break down for \p\ > mhad- 
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2.3.6. Lessons for lattice simulations (continued) 

(IV) We can use XPT to extend the reach of lattice calculations to multi- 
particle processes. Simulations can only access properties of single or two 
particle states, with the latter being a challenge away from threshold. How- 
ever we can use lattice calculations to determine the LECs, and then use 
XPT to calculate inaccessible processes. For example, one can determine 
A(K — ► 7T7r) using unphysical, but more accessible, matrix elements. 

(V) The downside of relying on XPT is the inevitable presence of a trunca- 
tion error. If we want to estimate this error, we need fits including the next 
higher order terms. For example, to reliably determine the Lql discussed 
above, one must do fits including, at least approximately, NNLO terms. In 
fact, the full NNLO expression in PQXPT is available for PGB masses and 
decay constants 31 . 

2.3.7. Technical aside: adding sources 

Electroweak currents can be used to probe PGBs. For example, the weak 
leptonic decay 7r — ► tU is proportional to the matrix element of the axial 
current, (0|^4^|7r), itself proportional to the pion decay constant f n . It 
is thus important to incorporate vector and axial currents into the XPT 
framework. Other operators are of interest too, and I discuss here the 
scalar and pseudoscalar densities. The aim is to map such operators at the 
quark level into operators in the EFT in such away that matrix elements 
agree, up to truncation errors in XPT. This mapping should be based only 
on the symmetries of the operators and the action. 

A convenient method for doing the mapping is to start with the QCD 
action including position-dependent sources for left and right-handed cur- 
rents (Ifj, and r M , respectively) , and for the densities (s and p) : 

£qcd = Q L {P- i l^^)QL+Q R {p-il l ,r p )Q R +Q L (s+ip)QR+Q R {s-ip)QL ■ 

(24) 

Functional derivatives of the partition function Zqcd(^, t>, s,p) with re- 
spect to the sources (which are hermitian matrices in flavor space) bring 
down the corresponding currents and densities. Note that s and p are just 
a (position-dependent) rewriting of the mass spurion, M — > s + ip. After 
derivatives with respect to the sources have been taken, they are to be set 
to zero, except for s + ip, which is set to M) 



J For the standard diagonal mass term, s is set to M and p to zero. 
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One nice feature of Zqcd(^,?>, s,p) is that it represents what one can 
actually calculate in QCD — correlation functions of gauge-invariant opera- 
tors. It allows one to access PGB physics, because the axial currents (and 
pseudoscalar densities) couple to pions. For example, a 4-point correlator 
of axial currents, analytically continued to Minkowski momenta, can be 
amputated and then evaluated at the pole positions so as to obtain the 
pion scattering amplitude using the LSZ formula. Similarly one can obtain 
matrix elements of currents and densities between PGBs (or other states). 

Thus it is natural to phrase the matching of QCD with the EFT in 
terms of this partition function 26 

Z x {l^,r^,s,p) = Zqqd (^,r>,s, p) + truncation errors (p n , m v <C A x ) . 

(25) 

This simply says that the correlation functions of the two theories must 
match for momenta such that PGBs give the dominant contributions, up 
to errors due to the truncation of XPT. The matching ensures that masses, 
scattering amplitudes and matrix elements agree in the two theories. 

This is nice packaging, but to give it content we need to learn how to 
add sources to Z x . This can be done by generalizing the spurion method 
to enforce a local SU(3)l x SU(3)r symmetry. £qcd in (24) is invariant if 
Z,r^ transform as gauge fields, 

I^UlI^uI+iUlO^uI, r^U R r^ R + iU R d^ R (26) 

(with Ul.r = Ul.r( x )), while s and p transform as before, \ = 2Bq(s + 
ip) — > UlxU r - Here I have introduced the convenient matrix \- Note that 
this invariance only works because the left and right-handed currents are 
Nocther currents for the corresponding symmetries. One would like to argue 
that, in order to satisfy cq. (25), C x must also be invariant under local chiral 
transformations. This turns out to be correct, but the argument, given by 
Leutwyler 32 , is subtle. First one must deal with anomalies: Zqqb is not 
invariant under local chiral transformations, but has a variation which is 
a known functional of the sources alone. This must be matched by the 
variation in Z x , which in turn requires the presence of the WZW term 
in C x . This allows the remainder of C x to be invariant, but this is not 
automatic. Leutwyler shows that it is possible to bring it to an invariant 
form if one uses the freedom of adding total derivatives (which do not 
change the action) and changing variables. 

The invariance of C x under local transformations can be accomplished 
in the usual way: by replacing normal derivatives with covariant derivatives 
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which transform homogeneously, e.g. 

= fyE - ii M S + zSr M -» U L {D^)U{ . (27) 

Note that this fixes the normalization of the Z,r M terms, implying that 
the normalization of the currents in the EFT is known. k This is possi- 
ble because the currents generate a non-abelian group, whose algebra is 
schematically [L, L] ~ L, in which normalizations are fixed. Terms without 
derivatives, e.g. mass terms, are automatically invariant under the local 
symmetry, so do not need to be changed. The inclusion of sources also 
allows new types of terms in C x , as will be seen shortly. 

2.3.8. Final form of chiral Lagrangian 

We now have the ingredients to construct the EFT through NLO including 
sources. The LO result differs from the earlier form (16) only by — > 
and 2B M -> X - 

= ^tr (i?„££>„Et) - ^tr( X St + E % t) , (28 ) 

The addition of the sources allows us to match currents with QCD, e.g. 
equating 5Z / 5l p (x)\i =r=p= o ]S= M in QCD and XPT implies 

Q Ll ,T a Q L ~ (if 2 /2)tr(T a Zd^) = -(f/2)d^ a + .... (29) 

This result can also be obtained using the Noether procedure. It allows 
us to determine /: from the vacuum to pion matrix element one finds 
/ = / w w 93 MeV. Similarly, we can relate B to quark-level quantities. 
Taking SZ/Ss(x)\ l=r=p=a , s= M gives 

QQ ~ -(/ 2 Bo/2)tr(S + St) = -Nf 2 B + 0(n 2 ) , (30) 

the VEV of which gives (qq) = —f 2 Bo (with q = u, d, or s). This is a place 
where the lattice can contribute, since the condensate is not experimentally 
measurable but can be directly calculated on the lattice. Alternatively, 
since the PGB masses give the combination B m, as seen above, a lattice 
determination of the quark masses gives another result for B - Comparing 
the two determinations tests the accuracy of XPT at LO. 

At NLO the addition of sources leads to new terms. For three flavors, 
the enumeration in sec. 2.3.1 gave 8 independent terms plus the WZW 



k One could also obtain the correct normalization in the absence of symmetry breaking 
by matching the Noether currents in the two theories. 
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term. Sources allow two additional terms involving the field-strength ten- 
sors L, (constructed from the source gauge fields I, in the usual 
way), as well two terms involving sources alone. The latter are multiplied 
by so-called high-energy coefficients (HECs). In sum, one has 

£ (4) - -Li [tr(D^D^)] 2 - L 2 ti(D^D,^)tr{D^D^) 
+L 3 tr(D^D^D^D^) 

+L 4 tr(^St^S)tr( X t S + £ f x) + L 5 tv(D^D^[ X ^ + ^x]) 
-L 6 [tr( X ^ + ^x)] 2 -L 7 [tr^E-E^)] 2 - L & tr^Sx^+p.c.) 
+L 9 itiiL^DfjYiDvYJ + p.c.) + L w tr(L A1 ^Si? Ally E t ) 
+H 1 tr(L^L^ + p.c) - H 2 tr(x f x) + £\vzw , (31) 

There is also a term oc tr(Z? M x^L> M E), which contributes only to the mass 
dependence of the currents, but it can be removed by a change of variables 
for E, and is thus redundant. The dimensionless L, are the well-known 
"Gasser-Leutwyler coefficients". A subset of them can be determined ex- 
perimentally to good accuracy and a different subset is straightforward to 
determine on the lattice, as illustrated below. 

What about the HECs, Hi.-p. Since these multiply terms which do not 
involve S they give contact terms in correlation functions (e.g. Hi gives a 
contribution to C P (x) = (Qj 5 T a Q(x)Qj 5 T a Q(0)) proportional to H 2 5(x), 
and also contributes to the mass dependence of the condensate). Thus they 
do not contribute to physical scattering amplitudes or matrix elements. 
Why does one need them? They are required if one wants to describe the 
mass dependence of the condensate (which can be calculated on the lattice) 
within XPT, and in order that quark level Ward identities are satisfied (e.g. 
m / d 4 xCp(x) oc (qq))- 1 For the matching to work, the HECs must depend 
on the regulator used for QCD. For example, if QCD is regulated using a 
lattice, then H2 oc 1/a 2 because it must reproduce (qq) oc m/a 2 . This is 
in stark distinction to the LECs which do not depend on the underlying 
regulator (leading to their different name), and are physical parameters. 



'One might be concerned that, because the short-distance behavior in QCD and XPT are 
different (operators having different dimensions in the two theories), the whole notion of 
matching the partition functions with sources is flawed. How can correlators be matched 
when the operator product expansions differ? The answer is that the matching is done 
only for \p\ < A x , so one avoids the short-distance regime in the underlying theory. 
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2.4. Examples of NLO results 

With £( 2,4 ) in hand, it is straightforward to calculate NLO results for 
physical quantities. For example, the pole in the two-point function of 
the left-handed current gives the PGB mass, while the residue is propor- 
tional to /pq B (just as in lattice simulations). Recall that the LO result is 
to pgb o = (Xqi +Xq2)/2 = 2B (m q i+m q2 )/2. At NLO, there is a tree-level 
contribution 

<W-gb~ ^ ~ X Lf~x(lGn 2 L)?^, 
and a 1-loop contribution 

I quote the final result for the tt^ as an example 26 : 



g or t 

"4± = Xt{ l + 1 5l(2L 8 ~L 5 )xe + (2L 6 -L 4 )(2 X e + Xs)} + 1 '~ : ^" 



valence 



6 

logs 



(32) 



where xi — (Xu + Xd)/2, and the chiral logs are (using dim. reg.) 



9 / ? 

mi , / mf 



L -=^Kf)- L «=4 u \v}- (33) 

The /i dependence is absorbed by the Z^, as discussed above. Looking 
ahead to the discussion of PQQCD, I have separated the analytic terms 
into those that arise from the masses of the quarks which compose the pion 
("valence") and those of the quarks in loops ("sea"). How this is done will 
be explained in sec. 4. Within QCD itself this separation is not useful, as 
corresponding valence and sea quarks have the same masses. 
Another example is the ratio of decay constants, which is 

^ = 1 + ^ (L 5 )(Xs-Xi) + \L, \l k - \L V . (34) 

valence , 

logs 

This result allows L5 to be determined from experiment. The result depends 
on the choice of /x, a conventional value being [i = m p . 

These results illustrate the general structure at NLO: there are cor- 
rections analytic in x an( i dependent on the masses of valence and sea 
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Figure 4. Typical behavior of m^/mj as a function of mi/m s at NLO. 

quarks, and chiral logs that are non-analytic in \. The expansion parame- 
ter TOp GB /A^, is clear in the logs, but is obscured in the analytic terms by 
the convention for the Lj (which are numerically of size < l/(167r 2 ).) 

2.4.1. Lessons for lattice simulations (continued) 

(VI) Non-analytic terms become important at small masses. To illustrate 
this, I plot m^±/mi versus mi/m s in Fig. 4, with mi the average light- 
quark mass. I hold the strange mass fixed at m s — 0.08 GeV, and use 
values for the LECs that are representative of those from XPT analyses: 
/ = 0.093 GeV, L 5 = 1.45 x 10~ 3 , L 8 = 10~ 3 , L 4 = L 6 = (with fj, = m p 
here and below). The curve's lower end is approximately the physical point. 
At LO the XPT prediction is a constant. The analytic NLO corrections 
lead to linear dependence on mi, and the logs to curvature. Clearly, to 
obtain 1% accuracy one must simulate down to mi/m s w 0.1 in order to 
see and fit to the predicted curvature. 111 This has been achieved in the 
MILC simulations. In my opinion, seeing curvature consistent with XPT 
predictions is a necessary check on lattice techniques. Similar comments 
hold for fx/ fn, plotted in fig. 5 using the same parameters except now / = 
0.085 GeV (chosen so as to better match the experimental value at the lower 
end of the curve). Here the linear terms are larger, but an extrapolation 



""Extrapolation can be simplified in some cases by considering "golden (silver) ratios" 
in which the chiral logs completely (partially) cancel 33 . By contrast, for some quantities 
the chiral logs are enhanced, e.g (r 2 ) n ~ ln(m^//i 2 ) rather than m 2 ln(m^). 
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accurate to a few percent still requires inclusion of the curvature. 




Figure 5. Typical behavior of fx/fw as a function of m;/m s at NLO. 

(VII) The results further illustrate the utility of the lattice for obtaining 
LECs. In nature one can vary the valence content by considering different 
PGBs, while the sea content is fixed. Thus L4 and Lq are not accessible 
using the results above. On the lattice, however, one can change the sea 
quark masses and thus determine these LECs more easily. Using PQ sim- 
ulations (varying valence and sea masses independently) further simplifies 
the determinations, as will be discussed in sec. 4.5. 



2.4.2. Volume dependence from XPT 

For single particle matrix elements, PGB loops loops give the leading correc- 
tion due to the use of a finite spatial volume. To obtain the leading volume 
dependence, one simply replaces the momentum integrals with sums, e.g. 

(q^) ~* Iq (g 2 +m^ G ) ~ * hi ^q=^n/L (q 2 +m 2 NG ) ' 

This is now done routinely when fitting lattice data. Figure 6 illustrates 
the rapid growth in finite volume shifts as the quark mass is reduced. 

Unfortunately, recent work suggests that an accurate estimate of volume 
corrections requires the inclusion of at least the dominant part of the two- 
loop contributions 34 . The requisite calculation has only been done for a few 
quantities, so for others one may be forced to work in volumes large enough 
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that finite volume effects are negligible. For determining such volumes one- 
loop results should be a reliable guide if used conservatively. 

If the quark mass is reduced at fixed box size satisfying L 3> 1/Aqcd, 
one eventually enters the so-called "e— regime" where itingL < 1. Here the 
pion propagator is completely distorted by finite volume effects. It turns 
out, however, that one can still use XPT to predict the form of correlation 
functions 35 . I will not discuss this regime further, but note that there is 
an ongoing effort to determine the LECs of QCD (including electroweak 
interactions) by comparing the results of simulations in the e— regime to 
the predictions of XPT 36 . 



2.4.3. Convergence ofXPT 

I have only scratched the surface of calculations in XPT, which have been 
extended to include the electroweak Hamiltonian in the PGB sector and 
to NNLO (as reviewed, e.g., by Bijnens 37 ). Many quantities are relevant 
for lattice simulations — I give a list below in sec. 4.5.1 when discussing 
PQXPT. Here I only discuss what has been learned about the important 
question of convergence of XPT. 

I quote one example 38 , obtained from a fit of the N — 3 NNLO formulae 
to a number of experimental inputs, but with the NNLO LECs estimated 
approximately using "single resonance saturation" . One of the input quan- 
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tities is Jk/ fn, an d this turns out to have a chiral expansion 37 

fx/U + £^£9 + O051 , (35) 

LO NLO NNLO 

showing reasonable convergence. The convergence is less good, however, 
for the PGB masses. 

The naive conclusion from this and similar results is that NNLO terms 
in XPT are needed for good accuracy. This is, I think, correct if one does a 
global fit to several quantities using SU(3) XPT. One might, in practice, be 
able to get away with including only the analytic terms at NNLO (whose 
form is easy to determine) rather than the full two-loop expression. This 
is the approach used in the MILC analysis 3 . This amounts to mocking up 
the two-loop contributions by changing the NLO and NNLO LECs. While 
this makes the results for these LECs less reliable, I expect it to impact 
the extrapolated results for physical quantities only at the level of NNNLO 
corrections. 11 Clearly, though, a full NNLO fit would be preferable. 

Another approach which reduces the impact of NNLO terms is to use 
SU{2) XPT alone, treating m s as heavy. After all, the actual extrapolation 
being done in present simulations is for the light quarks alone, with m s fixed 
near its physical value. In this approach the kaon and eta are treated as 
heavy particles, and one makes no assumption about the convergence of the 
expansion in m s . The idea is that this removes the dominant contribution 
to the corrections in (35) and sums them to all orders. In practice, this 
approach has been used primarily in the baryon sector. 



2.4.4. Extension to "heavy" particles 

I will not describe XPT technology for including heavy particles here, but I 
do want to mention the form of the results. "Heavy" means Whad > A x , and 



"This is based on the following argument. The dominant NNLO terms are those involving 
iBj , either explicitly or through factors of mx or . These are of size (mpc / A x ) 4 ps 0.03 
relative to LO terms, consistent with the result in eq. (35). (This is to be compared to 
purely light quark NNLO contributions — (m,r/A x ) 4 ps 0.0002 — and mixed light-strange 
contributions — m^m^/A 4 . ps 0.003.) The m a terms can involve logarithms of mpc or 
m^, but not m-n, since they cannot be singular when m u = —* 0. It follows that the 
dominant NNLO logarithms arc being evaluated far from the meson masses where they 
are non-analytic (mx,m v = 0), and thus can be well represented by analytic terms. 
This will be especially true if NNNLO LECs are included, as in some MILC fits. The 
sublcading NNLO logarithms involving m n will be much less well represented by analytic 
terms, but these are numerically smaller than the NNNLO contributions proportional to 
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the approach is to expand in 1/mhad so that at LO the hadron is a static 
source for PGBs. In this way one can include the dominant long-distance 
physics which gives rise to curvature at small light-quark mass. 

The form of the resulting chiral expansion depends on the quantity 
considered. For heavy-light meson decay constants it is similar to that for 
PGB properties, e.g. 

F B ~ F Bfi {l + my + m% ln(roir) + • • • ) (36) 

analytic chiral log 

One new feature is that the non-analytic terms are not predicted in terms 
of the LO LECs, but involve an additional coefficient, Q^bb* ■ 

For baryons and vector meson masses the expansion differs further, in- 
volving odd powers of mpGB° 

M H ~M + ml +g wH H>ml+ m*ln(m n ) + m% + . . . (37) 

V v ' S v ' 

analytic leading loop sublcading loop 

This means that the expansion is in powers of m n /A x (c.f. (m 7r /A x ) 2 
for PGBs and heavy-light mesons), so that the convergence is generically 
poorer. Thus it is even more important to use light quark masses when 
studying baryon properties. 13 

3. Incorporating discretization errors into XPT 

In this lecture I describe how for Wilson and twisted-mass fermions one can 
incorporate discretization errors into XPT, and what one learns by doing so. 
The method is general, and has been applied also to staggered fermions 8 ' 9 , 
and to mixed-action theories 39 . See also the review by Bar 40 . 

3.1. Why incorporate discretization errors? 

At first sight, it may seem strange to incorporate the effects of the ultraviolet 
(UV) cut-off of the underlying theory into the EFT describing its infrared 
(IR) behavior. The key point is that the UV effects break the chiral sym- 
metry which determines the IR behavior. One way of saying this is that 
discretization errors lead to a non-trivial potential in the vacuum manifold 



° There is a similar "m^" contribution to heavy-light meson masses but there the leading 
term is Mh ea vy— light ^> Aqcd, so the correction is less important. 

p For vector mesons, and unstable baryons, the chiral expansion is yet more complicated 
because of the opening of the decay channel as the quark mass is reduced. 
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which is otherwise flat due to the symmetry. As we will see, symmetry 
breaking due to quark masses and discretization errors have comparable 
effects on the potential if ro/A w (aA) 2 , with A a scale of O(Aqcd)- The 
appropriate value of A depends on the action, and it is quite possible that 
this condition is satisfied even for relatively fine lattices and light quarks. 
For example, if A = 500 MeV and a -1 = 2 GeV, then it is satisfied when 
m = 30 MeV. Thus it is imperative to study the impact of discretization 
errors. q 

One question that often arises in 
the present context is whether one 
should first extrapolate a — > and 
then use continuum XPT, or do com- 
bined extrapolation in a — > and 
m — > m p h ys - The possibilities are il- 
lustrated to the right. 

An apparent advantage of the first approach is that one does not have to 
rely on the validity of XPT to do the continuum extrapolation; one simply 
uses a standard polynomial ansatz. There are, however, several reasons to 
use the second, "combined" , approach if XPT formulae are available: 

• It incorporates relations between discretization errors in different 
quantities that follow from the specific way in which chiral symme- 
try is broken. 

• It accounts for non-analyticities in a which arise because of pion 
loops (e.g. for staggered fermions one has, schematically, m 2 ~ 
m q [l + (m q + a 2 ) ln(m q + a 2 ) + . . .]). These might well be missed 
in a simple polynomial continuum extrapolation. The "a 2 " in the 
chiral logs reduce the curvature, as clearly observed in the MILC 
results 3 . It should be kept in mind, however, that "a 2 " always 
means "up to logs" , so not all non-analyticities are included. 

• It accounts for changes in orientation of the condensate, which can 
be rapid with twisted- mass fermions if m ~ a 2 , as discussed below. 



q As an aside, I note that if one uses lattice fermions with an exact on-shell chiral sym- 
metry (overlap, perfect or Domain-wall fermions with N$ — > oo) then the considerations 
of this lecture become almost trivial. These fermions are described by continuum XPT, 
but with LECs that depend on a 2 and must be extrapolated to the continuum limit. The 
only exception is that there are additional terms induced by the breaking of Euclidean 
symmetry, but these are of very high order in the meson sector, as discussed below. 
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3.2. General strategy 

One proceeds in two steps 41 . First, following Symanzik 20 , determine the 
continuum EFT describing the interactions of quarks and gluons with 
\p\ <C l/a. Discretization errors enter with explicit factors of a, and are 
controlled by the symmetries (or lack thereof) of the underlying lattice the- 
ory. Second, use standard techniques to develop XPT for the Symanzik 
EFT. Since the latter is a continuum theory, this is no different conceptu- 
ally from determining the effect of beyond-the-standard-model physics on 
the IR properties of QCD. The two steps are illustrated in Fig. 2 above. 

3.3. Application to Wilson & twisted mass fermions 

Twisted mass lattice QCD (tmLQCD) 42 has received a lot of recent atten- 
tion because of its improved algorithmic properties and because, at maxi- 
mal twist, physical quantities (including matrix elements) are automatically 
0(a) improved 43 . Here it also serves as an excellent example, particularly 
as it contains Wilson fermions as a subset/ 

Twisting the mass in continuum QCD simply means doing an SU(3)l x 
SU(3)r rotation. The standard diagonal mass Mo (which is hermitian 
assuming real m q ) is rotated into M = UlMqU r which is not hermitian. 
The example I consider in detail has two degenerate flavors and 

M = m q e" 3U = m q (cosLu + i suiujts) = m + i/iT3 , (38) 

resulting in a Lagrangian mass term containing a 75X3 part: 

Q L MQ R + Q R M^Q L = Q(m + i/x 75 r 3 )Q . (39) 

Note that m q is the physical quark mass, with m and [i respectively the 
untwisted and twisted components. 

The "geometry" of the parameters is 



shown to the right. Although it naively 
appears that parity and flavor are broken, 
we know this is not the case since physics 
is unchanged by the chiral rotation. Thus 
uj is a redundant parameter. Usually, we 
keep the symmetries manifest by working 
at u) = 0, but it is important to know that 




we do not need to do so. The continuum 



r For more extensive discussion of tmLQCD and its applications see the lectures by Sint, 
and recent reviews 44,45 . 



February 1, 2008 5:29 Proceedings Trim Size: 9in x 6in 



narav3 



32 

%PT analysis described above goes through for any u>, as long as we expand 
about the rotated vacuum. 

The situation is quite different after discretization. The lattice action is 

SWue + a 4 ^i D w^i + \ L Miln,R + \rM^, l , (40) 

X 

with M the twisted mass, and the subscript "Z" indicating "lattice". Here 
Dw is Wilson's doubler-free derivative, 

p^d w = Ij2^(K+v»)-1 E( v w ( 41 ) 

(V and V* are forward and backward derivatives, respectively). Since the 
second term in D w (the "Wilson term", in which I have set the Wilson 
parameter r = 1) breaks chiral symmetry, one cannot rotate away the twist 
in the mass. The theories with mass term M and UlM U r are different 
on the lattice. In fact, the full fermion matrix Dw + MPr + M^Pl has 
positive determinant (and is thus useful in practice) only for special M. 
One such choice is two flavors with the twisted mass (38), for which the 
lattice action is 

SVmLQCD = S^giuc + « 4 ^^ (D w + m + n 5 r 3 ^o) 4>i ■ (42) 

x 

Here mo and /zo are, respectively, the bare untwisted and twisted mass (in 
lattice units). For the remainder of this lecture I will focus entirely on this 
theory. 

3.4. Determining the local effective Lagrangian 

Symanzik 20 showed how to study the approach of the lattice theory to its 
continuum limit. The first step is to understand this limit itself. The lat- 
tice provides a legitimate regularization of QCD (one that is awkward from 
a perturbative point of view, but has the great advantage of being non- 
perturbative), and so one obtains continuum QCD (in this case tmQCD) 
as the cut-off 1/a is sent to infinity. This has been established to all or- 
ders in perturbation theory 46 and it assumed to hold non-perturbatively. 
One must appropriately tune the ("relevant") lattice bare parameters to 
reach the continuum limit. In particular, since the Wilson term mixes with 
the identity operator, mo is additively renormalized, and must be tuned to 
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m c (a) < 0, while the twisted mass has nothing to mix with and is multi- 
plicatively renormalized. s The resulting continuum theory is 

AmQCD = £gluc + 1p(P + m + i"f5T3fJ.)lp , (43) 

with £ g iue the usual continuum gluon action, and with the continuum field 
and physical masses being 

if) = a~ 3 ^ 2 Z ipi , m = Zg 1 (mo — m c )/a , and fi = Z~ p 1 [io/a. (44) 

Here Z, Z$ and Zp are renormalization factors relating quantities in the 
lattice regularization to those in the chosen continuum scheme. The corre- 
sponding geometry is illustrated below. 



m c /a m 7 



H /a 



m^a 



Corrections to the continuum limit are suppressed by inverse powers of 
the cut-off, i.e. by positive powers of a. Symanzik showed how these can 
collected into a local effective Lagrangian 

£ S ym = AmQCD + a£ (5) + a 2 + . . . , (45) 

where AmQCD is the desired continuum Lagrangian (note the absence of 
the "L" in the subscript), and £' 5 ' and £^ contain terms of dimension 
5 and 6, respectively.' The effective theory must be regularized, either 
by a standard continuum regulator such as dimensional regularization, or 
possibly with a finer lattice having a' -C a. We will not actually use £sym 
for concrete calculations so do not need to be more specific. A key feature 
of £sym is that all factors of a are explicit — the effective theory does not 



s One must also tune the bare coupling go — > in the usual way. 

'The number of effective Lagrangians in these lectures is approaching a confusing level. 
Note that I always use the subscript \ for the chiral Lagrangian, so that the six-derivative 
contribution £^ 6 ' can be distinguished from £' 6 ' in eq. (45). 
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"know" about the lattice spacing in any other way. Note, however, that 
"a" includes logarithms, of the form ~ a[l + g(a) 2 In a + . . . ] . I return to 
this point below. 

The content of eq. (45) is that all discretization errors in all correlation 
functions can be reproduced by a set of local insertions. This is established 
by a procedure akin to renormalization (in which one determines divergent 
parts of graphs by doing a Taylor expansion in the external momenta, or a 
lattice variant of this procedure 46 , and then subtracts them), except that 
one subtracts more terms in the Taylor expansion ( "over-subtraction" ) , 
thus including those proportional to powers of a. As with renormalization, 
the consistency of this procedure requires that one include all terms in 
of the appropriate dimension which are invariant under the symmetries of 
the theory — here, of tmLQCD. These terms will have coefficients such that 
reflection positivity is satisfied, since they arise from a theory in which it 
is satisfied." This procedure also works if one includes sources for external 
operators, which should be treated using the spurion trick. The procedure 
has been demonstrated to all orders in perturbation theory, and is assumed 
to work non-perturbatively. 

The preceding discussion is nothing other than (a sketch of) a deriva- 
tion of an EFT. The usual EFT words — "separation of scales" — were not 
mentioned but were implicit. £sym is only useful if p <C 1/a, for otherwise 
successive terms in the expansion, which give contributions of relative size 
ap, are not suppressed. Thus £sym is an EFT for quarks and gluons with 
energies far below the cut-off scale. The set-up is the same as when con- 
sidering the impact of new short-distance physics on QCD, except that the 
new physics here violates rotation and translation symmetries. It indicates 
how one can derive an EFT in a Euclidean context, at least order-by-order 
in perturbation theory. One does not need to rely on the S-matrix argu- 
ment of Weinberg. This is important because the underlying lattice theory 
is discretized in Euclidean space. 

Let me illustrate these general words with a simple example. Consider 
the quark-gluon vertex shown in Fig. 7. In EFT language, the counterterms 
£(5,6,... ) are k e determined by matching correlation functions with those 
of the lattice theory. At tree-level, the 0(a) terms in the lattice vertex can 
be matched by adjusting the coefficients of operators in £^ 5 ' such as ipD 2 i(; 



"This is true for the action used in the text. Reflection positivity is violated with 
improved Wilson fermions or improved gauge actions, but it is expected that this docs 
not effect the long distance physics which is being captured by £sym- 
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as long as \p\ -C 1/a. At one-loop, for \q\ «l/a, the integrands match by- 
construction, giving a logarithmic divergence. For larger \q\, however, the 
matching fails, leading to a finite difference in the results (finite since both 
theories are regulated). To match one must add an 0{g 2 ) contribution to 
the coefficients of the operators in . One loop matching is schematically: 

apg 2 {\n[pa] + n at ) = apg 2 (\n[pa!] + r EFT ) + apg 2 c^ T (46) 
=> Ceft = r iat - r EFT + ln[a/a'] , (47) 

where Hat, eft are the finite parts of the loop diagram, and 1 have used 
a lattice regularization of the EFT with spacing a' . We can now see the 
generic form of the a dependence of the coefficients in (whose one- 
loop contribution is given here by g 2 c^ T ). There is explicit logarithmic 
dependence, and an implicit logarithmic dependence through g, which is 
evaluated in the lattice calculation at a scale ~ a. 
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Figure 7. Matching lattice QCD with the Symanzik EFT. Dashed lines are gluons, solid 
lines quarks, p <C 1/a an external momentum, q a loop momentum. 



As an aside, I note that, having determined the form of in the 
EFT, one can add corresponding terms to the lattice theory (i.e. terms 
having operators in as their classical continuum limit), and then adjust 
the coefficients of the lattice terms to set those in £( 5 ) to zero. This is 
the "improvement program" at O(a) 20 , and it has been implemented non- 
perturbatively 19,47 . In the "new physics" context this would be called "fine 
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tuning", with negative connotations, but here we can again turn all the 
knobs at our disposal to improve extrapolations. The program can be 
extended, in principle, to any order, but has in practice not been extended 
to £( 6 K Since will play a key role in the following, this means that the 
considerations below are essentially unaffected by improvement. 

3.5. Symanzik effective action for tmLQCD 

We are now ready to determine the operators in and . These must 
satisfy the symmetries of tmLQCD, eq. (42), and be reflection positive. 
The symmetries are gauge invariance, lattice rotations and translations, 
charge conjugation and fermion number, but not flavor SU (2) nor parity 
(for generic to). Only the [7(1) flavor subgroup generated by T3, and com- 
binations of parity with a discrete flavor rotations survive: 

Vp 2 : ipi{x) -> 70(^1,2)^^) j M x ) ^l(xp){-iTi,2ho , (48) 

Also useful is V: parity combined with [/j,q — ► — fi ]. 

The continuum Lagrangian consistent with these symmetries is tmQCD, 
eq. (43). To determine one simply enumerates all allowed operators, 
generalizing the work done for Wilson fermions 19 . The result is 48 

£ (5) = biipia^F^i/j + b 2 tp(p + m + i~f 5 T 3 n) 2 ip 

+bsmip(p + m + 1751% fi)i/> + bimL g \ nc + b^m 2 ^ 
+b 6 fiil)[(p + m + H 5 T3/i),i7 5 T 3 ] + V> + b 7 ^ 2 ipil> , (49) 

where I use continuum masses m, /1 rather than bare masses. The coeffi- 
cients bi (the analog of the LECs of XPT) are real (from reflection positiv- 
ity), and depend on g 2 (a) and In a. Among them 65 7 are "new" compared 
to Wilson case. Many terms have been forbidden by lattice symmetries: P 
forbids m/z^ and m 2 ipi^5T3ip, and it requires V' r 375V ; to come with factor 
fiQ oc /jl, and the twisted Pauli term xpa^F^Tsip to have factor of \i (so 
that it appears in £( 6 )); flavor [7(1) forbids ^1,2^; and V 1 / forbids ^75^, 
F^F^v and iprzip. 

£( 5 ) looks rather forbidding, with 7 unknown coefficients, but this 
proliferation is misleading, for two reasons. First, we will be doing a 
joint continuum-chiral expansion, and working in the "generic small mass" 
(GSM) regime in which m <~ /i <~ aAq CD . Thus each factor of m or fi 
counts as an additional power of a. We will work at NLO in this power 
counting. Since ipTftip and C g i uc map into LO operators in XPT, and 
comes with an overall factor of a, any further factors of m or fi make the 
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operator of next-to-next-to- leading order (NNLO). This allows us to drop 
all except the 61 term, and the iptf) 2 ip parts of the 62 and 63 terms. 1 Second, 
we will be mapping £sym into XPT, at which point all that matters is the 
chiral transformation properties of the operators. Now the Pauli (61) term 
and tp^) 2 tp transform the same way, so, since the coefficients in the mapping 
to XPT arc unknown, we can drop the latter operator. The outcome is 

^nlo = biipicr^F^ip , (50) 

which is unchanged from the result for (untwisted) Wilson fcrmions. 
Moving onto there are now gluonic terms 49 

4iuo ~ ^(D p F pa D p F prT ) + TriD^DpFpr) 

+ TtiP^D^) + (m 2 ,^)Ti(F^F^) , (51) 

Lorcntz violating 

where I use a schematic notation without coefficients, and fermionic terms 
(obtained by generalizing the analysis for Wilson fermions 50,51 ) 

Cf ~ +^D„pD^ + ... 

Euclidean non-invariant 

+ mipp 2 ^ + fJ,ipp 2 ij5T 3 ip + mipia^F^ip + /J.ipa tlv F lJiI ,'y 5 T 3 ip 

N y / ' / 

0(a 2 m, a 2 fl) so NNLO 0(a 2 m, a 2 p.) so NNLO 

+ (m 2 , fi 2 )ipy)->p + m^ippi"f 5 T3-)p + (m 3 , to/x 2 )^ + (m 3 \ ^m 2 )ij 5 T3ip 

S v ' S v ' 

0(a 2 m 2 ), etc. so NNNLO 0(a 2 m 3 ), etc. so NNNNLO 

+ {^f + (^7mV0 2 + ■ ■ ■ , (52) 

where the first ellipsis indicates other Euclidean invariant terms with three 
derivatives and the second other four-fermion operators. As can be seen, 
in the GSM regime, most of the fermionic operators in are of at least 
NNLO. In particular, no flavor-parity breaking terms appear, since they 
require a factor a /j,. The net result is that the part of of NLO in the 
GSM regime is the same as that for untwisted Wilson fermions (with the 



'in the usual discussion of on-shcll Symanzik improvement, one drops terms vanishing 
by the LO equations of motion, which only contribute to contact terms. This is not 
necessary here, but explains the basis used in eq. (49). 
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(53) 



„ ' 

Euclidean non-invariant 



Note that the only symmetry broken by C^l that is not already broken by 
£nlo i s Euclidean invariance. In fact, we will see that in the PGB sector 
the Euclidean non-invariant terms pick up an additional factor of p 2 and 
(given the overall a 2 ) are of NNLO. 

This takes care of the action, but what about currents and densities? 
The matching of these between the lattice theory and the EFT can be 
worked out using symmetries 20 , and I quote only the relevant results 19 ' 54 : 



Here I work at NLO in the GSM regime, so am ~ a/i terms are dropped. I 
also drop the mixing of S° with the identity operator, as it does not con- 
tribute to connected matrix elements. The coefficients cy,A and gs depend 
on g(a) and on ln(a), just like the bi above. The content of these equations 
is that the on-shell matrix elements of the operators shown, evaluated in the 
EFT, will reproduce those of the lattice currents and densities, including 
the leading discretization error. These forms apply for any choice of lattice 
currents and densities (e.g. ultra-local or smeared) as long as they have 
been multiplied by appropriate Z-factors so as to be correctly normalized. 
The numerical values of the coefficients 5y etc. will, of course, depend on 
the form of the lattice operators, and on the lattice action. In particular, if 
the action and operators have been 0{a) improved, then these coefficients 
will vanish. Note that the density P b is automatically improved. 

When we map the operators in (54-56) into XPT we are free to do this 
for the 0(1) and 0(a) parts separately and then combine at the end. It is 
straightforward to see that the dy and gs terms map into operators which 
are of NNLO and can be dropped. For the former, the argument is given 
by Wu and I 55 , and follows because of the need to have three derivatives in 
order to match the quark-level operator. For the gs term the argument is 
even more simple: the matching of chiral singlet T^F^F^) gives the LO 




y ii 



s° 



i>l^T b ^ + ac A d~$ lb T b iP 
W> + agsTriF^F^) , 



P b = ^T b i> 



(54) 
(55) 
(56) 
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chiral kinetic term Tr^Ec^X^). Thus the g$ term is of size ap 2 , and so 
of NNLO compared to Ifip, which maps to Tr(S + £*) ~ 0(1). 

I conclude that, at NLO in the GSM regime, the currents and densities 
in the Symanzik EFT have the same form as in the continuum, aside from 
the ca term. Thus, as long as one treats the ca term separately, one can 
include the currents and densities using sources just as in continuum QCD, 
and the resulting theory will have a local chiral symmetry. 11 

3.6. Mapping the Symanzik action into XPT 

I now turn to the second step of the procedure — taking the Symanzik EFT 
and determining the chiral EFT which describes it at long distances. This 
was done for continuum Lagrangian, AmQCD, in sec. 2 — a twisted mass was 

(2) 

already included by the generality of the formalism. The result is C x of 
eq. (28) at LO and of cq. (31) at NLO. The task here is to include the 
effects of £( 5 ' 6 \ as well as the ca term in eq. (55). To do so systematically 
requires a power counting scheme, to which I now turn. 

3.6.1. Power counting and terminology 

As anticipated above, discretization errors introduce a new parameter into 
the power counting in XPT. In addition to the usual chiral expansion in 
powers of p 2 /A 2 , ~ m^ G /A^ ~ "ig/AQCD, we must include oAqcd- Note 
that Aqcd is the only scale available to balance dimensions when we map 
quark-level operators in £sym into XPT. I stress that, once one has the 
Symanzik EFT in hand, m and a are on a similar footing — both are simply 
small parameters in a continuum Lagrangian. 

How we should weight discretization errors relative to mass corrections? 
The numerical comparison is shown in Fig. 8. I have been conservative 
by having "present simulations" range down to m s /10, which is yet to 
be achieved with Wilson-like fermions. 111 I conclude from the figure that 



n There is a one subtlety here. After adding in the O(l) parts of the currents and 
densities, the local invariance is only true for the continuum part of the Symanzik action, 
■CtmQCD- It can be extended to £^LO' however, by allowing the corresponding spurion, 
called A below, to transform like \ under the local chiral symmetry. It can also be 
extended to -C^LO' but this is not actually necessary because the resulting contributions 
to the currents are of NNLO. I should note that there is some disagreement on the 
validity of this approach for mapping currents and densities 56 ' 57 , which is why I have 
given here a more detailed discussion than is present in the literature 55 . 
m Note that the comparison is not precise: the relative coefficients of mass and dis- 
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the appropriate power counting for the coming decade is o 2 Aq CD 
aAq CD , and that to disentangle quark mass dependence from discretization 
effects we need to remove errors of 0(a) and understand those of 0(a 2 ). 



present simulations 


a'A a 2 A 




aA 2 




io 


io l l 40 


o |' Jo 




m(MeV; 


(mu+md)/2 




ms/2 



Figure 8. Comparing discretization errors to mass corrections. I use a 1 = 2 GeV and 
A = 300 MeV. 



I will consider in the following two regimes (sketched below): 



I. The GSM regime, already 
introduced above, which I de- 
fine more precisely as aAq CD < 
m q <C Aqcd, so that it includes 
aAq CD <C m q (where we would 
like to be) as well as aAq CD sw m q 
(where we actually are). This is 
the regime in which we want to 
learn how to remove 0(a) errors. 




a 2 A 
GSM regime 



II. The "Aoki regime" , in which m q < a 2 Aq CD <C Aqcd (including m q <C 



1 A?. 



QCD^ 



This is where we cannot avoid discretization errors, and where 
they lead to non-trivial phase structure. Entering this regime changes the 
relative weight given to operators in XPT, but not the operators themselves. 

I begin by working in the GSM regime at NLO. This requires keeping 
terms of 0(a 2 ), and thus up to in the Symanzik expansion. The 
enumeration will turn out to suffice also for a LO study of the Aoki regime. 



cretization effects in the EFT are expected to be of O(l), but could easily be 2 — 3. 
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3.6.2. Mapping and into XPT 

^nlo transforms just like a mass term under SU(2)l x SU(2)r (recall that 
we set N — 2, although the considerations are easily generalized): 

a£ NLO ~ aipia^F^tp = ^lAia^F^n + ipnAha^F^ipA • (57) 

Here A is a spurion transforming like M, i.e. A — > UlAU^, so that £^1,0 
is invariant. At the end we set A = a. The enumeration of operators in the 
chiral Lagrangian is like that for M, yielding the new terms 52 ' 51 ' 53,55 

4 2 > A = -^tr(it£ + £ti) (58) 

= W 45 tr(D^D^)tr(A^ + £U) - W^[tv(A^ + £U)] 2 
-Wastrel! + E^M^E + SU) + W 10 tv(D^D^ + D^D^A) 
-H'M^X + X f A) - ff 3 tr(iU) , (59) 

where A = 2W$A is the analog of \ = 2Bq(s + ip), with Wo a new LEC 
at LO. We will not need to use A as a source (since we already have x)j 
so we will always be set A — > a = 2Wqcl. SU(2) simplifications can then 
reduce the number of terms. In particular (A^E + Y$ A) is proportional to 
the identity, allowing single trace terms involving this combination to be 
rewritten with two traces. These simplifications have been used in (59). 

Note that, following the discussion at the end of sec. 3.5, 1 have included 
sources for currents and densities and enforced local chiral invariance by 
using covariant derivatives [defined as in eq. (27)]. This incorporates 
all discretization effects in lattice matrix elements except those due to the 
ca term in eq. (55), which I will treat separately below. 

^nlo contains three types of terms. First, those that are invariant 
under Euclidean and chiral symmetries [the gluonic terms on the first line 
of eq. (53) and some of the four-fermion operators] . These match into XPT 
as follows: 8 

a 2 Ti{D^F piJ D^F piJ ) + ■■■+ a^D^D^ip + ...—> ahr(D^D^) , 

(60) 

i.e. one obtains the leading order continuum result multiplied by a 2 . This 
leads to an 0(a 2 ) correction to the LEC /, and is present for any fermion 
discretization (including chirally invariant ones). 

Second, there are four-fermion operators which violate chiral symmetry 
(e.g. those having LR-LR structure). Their matching can be analyzed 
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using two A spurions, leading to 51 

(^V) 2 + $1^? + ...—» tr [(i f S + SU) " . (61) 



2 



This operator is already present in having been produced by two 

(5) 

insertions of C nlo . This illustrates that what is relevant for matching are 
the symmetries broken by the operators (here, chiral symmetry), and not 
their detailed form. The four-fermion operators simply change an unknown 
coefficient, Wq S , by an unknown amount. The only exception is if one uses 
a non-perturbatively 0(a) improved quark action, as discussed below, when 
Wq 8 would vanish were it not for the four-fermion operators. 

Finally, there are the terms violating Euclidean symmetry. These can 
be decomposed into Euclidean singlet and non-singlet parts. The former 
match as in eq. (60), while the latter give rise to Euclidean non-invariant 
chiral operators 8 

a 2 Tr{D^ a D^ a ) + a^D^ — a 2 tr(£££L>* S t) . (62 ) 

Since one needs four factors of to make a non-invariant operator, the 
result, when combined with the two powers of a, is an operator of NNNLO 
in XPT, two orders higher than we are working. 

We thus find that £^l adds no new operators, so the results (58) 
and (59) are complete. They are to be added to C$ [eq. (28)] and C$ 
[cq. (31)], respectively, to obtain the full LO and NLO contributions to the 
chiral Lagrangian in the GSM regime. Note that using a twisted mass had 
no impact on the analysis of this subsection, since /I^lo nave the same 
form as for untwisted Wilson fermions. 

I now return to the ca term in the axial current, eq. (55). To obtain the 
full axial current in the EFT one must separately match this term into XPT 
and add it to the result obtained by taking derivatives of the Lagrangian 
obtained above with respect to sources. It is a simple exercise to show, 
however, that the result is simply to change the coefficient W\o, since the 
operator it multiplies is exactly of the form ad^P b , at linear order in the 
sources. Thus the final form of the previous paragraph remains complete, 
albeit with somewhat changed (although still unknown) coefficients. 

There are thus five new LECs introduced by discretization errors: IV Wo 
at LO, W45, Wq8, Wg S and W\o at NLO. What do we know about their 
values? Of course this depends on the choice of fcrmion and gauge actions, 



'This becomes ten new LECs in SU(3) or PQ theories 51 . 
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so we can only make order of magnitude estimates. Perturbing in a and m 
after rotating to Minkowski space, one finds 



W HV^F^Itt) a2 
-Bo (7r|V'V'l 7r ) QCD 



A^cd • (63) 



The first relation is not an equality because there are unknown coefficients 
multiplying numerator and denominator, while the second is a dimensional 
estimate. One might be tempted to include a loop factor because the matrix 
element in the numerator requires at least one gluon loop, but this is a non- 
perturbative matrix element so such counting is inappropriate. 

The four NLO Wi are dimensionless, because the dimensions needed to 
balance powers of a are provided by the Wo residing in A. Like the Gasser- 
Leutwyler coefficients L i} they depend on the renormalization scale, \i. We 
will find that the combinations 

W = W45 - £45 , W = W 6S - 2L 6S , and W = W^ 8 - W 68 + L 68 ( 64) 

are /x independent, so the scale dependence of the Wj themselves is com- 
parable to that of the L,. Using the argument of sec. 2.3.5, we then es- 
timate \Wi\ ~ \Li\ <~ l/(47r) 2 . Another line of argument gives a similar 
estimate. In continuum XPT, tpip maps into O = / 2 2B tr(S + S f )/4. 
Thus we expect the mapping of a 2 (tp^) 2 to contain 0(1) x a 2 2 plus 
other operators. In fact, this four-fermion operator (along with others) 
maps into a 2 O 2 16Wg 8 W 2 / \f A B 2 ) . Comparing, and using eq. (63), I find 
Wg 8 ~ (1/16)(//Aqcd) 4 - It is reassuring that this estimate, albeit crude, 
agrees with the order of magnitude of that above. 

The analysis to this point has assumed that the fermion action is not 
non-perturbatively improved. Since many simulations now use such im- 
provement, it is interesting to ask how it changes the analysis. Improve- 
ment sets £}p L o to zero. Since this was the source of almost all terms linear 
in a in C^'^ above, the impact is to set = W45 = Wqs = 0. v The 

term quadratic in a, with coefficient Wq S survives, since this also comes 
from matching with C$ lo . Two coefficients of terms linear in a do not 
vanish: Wio and H' s . Why? As already noted, the former multiplies a term 
which only contributes if the source for the axial current is non-vanishing 
(the part containing the vector current cancels). It thus represents dis- 
cretization errors in the matrix elements of this current. But these matrix 



v Note that the vagaries of the notation do not allow one to set Wo = 0, for then all 
discretization effects would vanish. 
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elements require improvement additional to that of the action, namely the 
addition of the "ca term" to the axial current 19 . If this improvement has 
not been implemented, then Wio ^ 0. Note that, following the discussion 
at the end of sec. 3.5, the matrix elements of the vector current and the 
densities are automatically improved (at NLO) if the action is improved. 

The non-vanishing of even with non-perturbative improvement is 
a consequence of the improvement being on-shell. Contact terms in cor- 
relation functions arc not improved, and this translates into the HECs of 
O(a) being non-zero. This is, however, an academic point, since the only 
quantity of interest affected by is the scalar condensate 55 , and this is 
very difficult to calculate in practice on the lattice. 

3.7. Results for m q ~ aAq CD (GSM regime) 

We are now ready to reap the benefits of the work we have done setting 
up XPT for tmLQCD, which I will call tmXPT. We can already see one 
benefit of the XPT technology: although there are several unknown LECs 
describing discretization errors, no more are needed for the entire twisted 
mass plane than for the untwisted Wilson mass axis. 

In this section I discuss the results in the GSM regime, first at LO and 
then at NLO. 

3.7.1. TmXPT at LO 

The complete LO Lagrangian include discretization errors is 

4 2 gsm = ^trOD^Et) _ ^ t r(x + S + S+ X ) - ^tr(A^ + Sti * 65 ) 

It will be useful to change the notation for sources: 

X = 2B (s + ip) = 2B (m + iuT 3 ) + Sx , 5 X = 2B (Ss + iSp) . (66) 

We now make the simple but important observation that the factors of A 
can be absorbed by using the shifted variable 41 \' = \ + A: 

4 2 gsm - ^-tr(£>„ED M £t) - ^tr( X 't£ + EV) . (67) 

The result has exactly the same form as the LO continuum Lagrangian. 
The shift \ ~ * x' corresponds to an 0(a) shift in the untwisted mass, 
m —* to' = m + oWq/Bq, but leaves Ss (and, indeed, Sx) unchanged.. 
Recalling the definition of to, eq. (44), this is equivalent to a shift in the 
critical mass, Am c = —a 2 Z s W /B . This shift is not measurable, however, 
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since m c is not known a priori. It must be determined non-perturbatively 
from the simulation itself. The traditional definition is that m c is the bare 



mass at which mL — > on the Wilson axis. Since at LO C 



mz oc m 



we discover that this 



,2- 



(2) 

X,GSM 



predicts 



definition of m c automatically includes 



the 0(a 2 ) shift in m c , and chooses the untwisted quark mass to be ml . 

The upshot is that, with the standard numerical definition of m c , the 
pion and vacuum sectors are automatically 0{a) improved at LO in tmXPT 
for any twist angle. The analysis of this theory is as in the continuum and 
I sketch it quickly. The twist angle ujq is defined by VI 

(ml + «T3/i) = m q e LU)aT ' 3 , (m q real and positive) (68) 

giving the geometry shown below (note the renormalized axes). 







H /(aZp) 


m_q / |i 












m;. /la Zs) 


^/(a Zs) 



The potential is minimized when £ is aligned with yf: (£) = exp(iwoT3). 
It is conventional to expand S about the condensate in a symmetric way, 

E=(oSpj,^, Co = exp(iLL} T 3 /2) , Spft = exp(i7? • f/f) , (69) 

since this corresponds diagonalizing the mass matrix with an axial trans- 
formation^ 11 The resulting theory is 



£ x ,lo = —triD^phD^l^ - — tr | 



ph 



-"ph. 



h.c 



} ,(70) 



where 



2Bom q and "h.c." is hermitian conjugate. The masses and 



interactions of the pions are manifestly independent of loq, since the mass 



VI This is another unfortunate notation 55 . This is a renormalized twist angle with a good 
continuum limit, because of the Z-factors in the definitions of m' and fi. The subscript 
on ojo does not indicate a bare quantity. 

VII One could equally well use, say, a LH transformation to diagonalize M, and a cor- 
respondingly asymmetric form for E. The difference is a U(l) vector transformation 
that has no effect on the physics, although there will be extra phases associated with 
operators 
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term has been "untwisted" . At the same time, however, sources have been 
twisted: (<?x)^ = ^o(Sx)^o and 

D^ ph = d^ ph - ilfH ph + iV ph rf , with If = tXto , rf = £ r^ . 

(71) 

The operators used to determine physical matrix elements (obtained by 
functional derivatives with respect to the physical sources) are thus related 
to those in the lattice theory (obtained using the original sources) by an in- 
dependent twist. This reproduces the twisting one finds at the quark level: 
fields in the original (or "twisted") basis [that with M = m q exp(ioj T3)] are 
related to those in the "physical basis" [M = m q ] by an axial transformation 
[4> P h = exp(iwoT375/2)i/)], so that operators are also transformed 42 , e.g. 

Uphl^lsdph = coswoiu^^d) - i sinwoC^d) . (72) 

Thus, at maximal twist (ujq = tt/2) the charged pion should be created 
with the lattice vector current. 



3.7.2. tmXPT at NLO 

The NLO Lagrangian, rewritten in terms of x', and dropping HECs, is 
- f ( 4 ) 4- r (4) 

= -L 13 tv(D„ED^) 2 - L 2 tr{D^D v tf)tr(D^D„tf) 
+L 4 MD^D^)tr(x'^ + £ f x') - L 68 [tr(x' f S + SV )] 2 
+mr(L> M S t D AI S)tr(i t S + sU) - Wtv( X '^ + S t x')tr(i t S + EU) 
-W [tr(itS + EU)] 2 + W 10 tv(D^D^ + D^D^A) . (73) 

Here I used SU(2) relations to combine terms, cq. (31), with L13 = 
L\ + L3/2, L45 = L 4 + £5/2 and L 6 8 = L§ + Lg/2. I have also used 
the shifted VF's defined in eq. (64). The latter, as noted above, turn out 
to be independent of the renormalization scale. If we are using a non- 
perturbatively O(a) improved action then W — W — 0, and if we also 
improve the axial current then Wio = 0. 

Subsequent results are simplified by the observation that W\q is 
redundant 55 . A change of variables, 5T, = (2Wio/f 2 ) (£it£-i), which 

keeps £ e SU(3) up to NNLO corrections, cancels the W10 term while 
shifting the other LECs: W -» W + W w /A and W -> W + W w /2. Be- 
cause of this, I set Wio = henceforth. Note that if one uses a non- 
perturbatively 0{a) improved action but an unimproved axial current, then 
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after this change of variables W and W no longer vanish, but are related 
by 2W — W = 0. 

I now present a sampling of NLO results 53 ' 58 ' 55 ' 56 . These require as a 
first step the determination of the condensate about which to expand, and 
this is realigned by NLO terms. In the continuum there is no realignment, 
because the Lqs term is already extremized by the LO condensate, (E) = 
exp(zwoT3) oc x' ■ The "discretization terms" (those proportional to W and 
W) do, however, lead to a realignment, because they "pull" the condensate 
either towards or away from the identity direction. The result is: 

(E)=e^)-, e = _I^|^(V + 2Trcosc,oi) (74) 

Note that e vanishes on the Wilson axis (ui = 0, 7r), and that the W 
contribution is enhanced if fn q <C a (in which case one enters the Aoki 
regime, where an O(l) vacuum realignment is possible, as discussed below). 

While e is not measurable in simulations (as it is defined within XPT 
and not in terms of observables), it illustrates the typical magnitude of NLO 
effects. Given that we expect \W\, \W\, \W'\ ~ 1/(4tt) 2 and W Q ~ A^ CD , it 
follows that e <~ oAqcd- To implement these expectations, it is useful to 
use rescaled variables 59 

WaW . . IQaW . , 16a 2 W 2a4 

° w = p ~ aA QCD, = ^ ~ aA QC D, w = — -p — ~ a A QCD . 

(75) 

The general form of NLO results for observables is illustrated by 

1 16 

2 " + ? 

+m q cosuiq(25w ~ + 2(cosluq) 2 w' . (76) 

The first line is the continuum NLO result [the SU(2) version of eq. (32), 
with chiral logs defined in (33)], while the second shows the impact of dis- 
cretization. Recall that m q is defined to be positive. The scale dependence 
of the chiral log is absorbed by 2L 68 — L 45 , leaving 2<5w — scale invariant. 
Chiral logs do not contain discretization corrections at this order because 
the LO discretization errors can be absorbed into \' ■ 

The result (76) shows the different possibilities for removing 0(a) errors. 

• Non-perturbatively O(a) improve the quark action, in which case 
2Sw — = and the 0(a) term vanishes. 

• Use "mass averaging" 43 in which one averages over u>o and ujo + 
7r at fixed m q . This flips the sign of both ml and /i, and thus 



1 + 77^ + -^mr q (2L 68 - L 45 ) 



February 1, 2008 5:29 Proceedings Trim Size: 9in x 6in 



narav3 



-18 



of cos Wo, and cancels the 0(a) term. It has been shown to lead 
to 0(a) improvement for all physical quantities, including matrix 
elements 43 . 

• Work at maximal twist, ui = ±ir/2 (both choices of sign are equiva- 
lent, and I use the positive sign henceforth). This removes the 0(a) 
term (as it must since it is a special case of mass averaging where 
the average is automatic 43 ), and, in this case though not in general, 
also the 0(a 2 ) term. 

I note two further features of the result (76). The 0(a) errors arc 
determined throughout the twisted-mass plane by the combination 26w — 
8yf. In particular, on the Wilson axis, this term predicts an asymmetry 
in the slopes on the two sides of m c . This has recently been observed 
numerically, as part of the initial studies of the properties of tmLQCD. I 
show an example of the results in Fig. 9. Defining the asymmetry in a 
quantity Q as 59 

^ (n , = Q(to» - (-)?>Q(-to» 
AbWj -QK, Ai ) + (-) P Q(-m', M )' (U) 

(with p a parity which is +1 for most quantities), one finds 

AS(to 2 ± ) = ( m '/m q )(2S w -S w ) ^ (25 w - S~) . (78) 

The observed asymmetry in fig. 9 is ~ 0.3, consistent with the expected 
size (a- 1 w 1 GeV and A QCD « 0.3 GeV). 

The second feature of (76) that I want to emphasize is that the w' term 
(proportional to o 2 ) gives an additive correction to to 2 . This is indicative 
of the breaking of chiral symmetry. It can be of either sign. If w' > then 
one expects a minimum pion mass, as observed in fig. 9, while if w' < 
the pion mass-squared can become negative, indicating an instability. This 
leads to the well-known Aoki phase 61 . vm 

A graphic demonstration of the impact of discretization errors is pro- 
vided by the contours of to 2 in the twisted mass plane. Examples are shown 
later in fig. 14. At LO the contours are circles (as in the continuum), in- 
dicating that oj is redundant, but they are significantly deformed by the 
discretization errors that enter at NLO. 

Discretization errors linear in a can be separated from the continuum 
parts of the predictions by using the asymmetries defined above. These are 



VIII Sincc these phenomena occur when rn q ~ & 2 Aq CD , i-e- in the Aoki regime, there 
are corrections to (76), which will be discussed below. 
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Figure 9. Unquenched results for {am^) 2 as a function of (2k) 1 = mo + 4 for fi = 
and with a' 1 0.2 fm 60 . Straight lines are to guide the eye. 



pure discretization effects, while the corresponding symmetries are 0(a) 
improved because symmetrization is the essentially the same as mass aver- 
aging. The asymmetries for quantities accessible with simulations are 55 ' 59 

2AS«0|P*| 7 r ± » = AS((w\S°\w)) = AS(m* ± ) = (m'/m q )(26 w -6 w ) (79) 
AS(U) = (m'/m q )S w /2 , AS(m PC Ac) - (m q /m')S w . (80) 

What is predicted here are the form of the mass dependence, and the re- 
lations between asymmetries in different quantities. The PCAC mass is 
defined below in eq. (89). Note that the asymmetries on the first line van- 
ish for a non-perturbatively improved action, while those on the second do 
not unless the axial current is also non-perturbatively improved. 

TmLQCD explicitly breaks flavor and parity symmetries, and tmXPT 
can be used to see how such breaking effects physical quantities. Of partic- 
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ular interest is flavor breaking, e.g. the splitting of the pion multiplet or the 
A-baryon multiplet. The good news is that such breaking is automatically 
of 0{a 2 ) for any twist angle. This is because ^75X3 structure of the 0(a) 
flavor breaking implies that it contributes at linear order only to parity- 
violating matrix elements. To obtain a contribution to masses one needs 
two parity-breaking insertions (to bring one back to the original parity). 
An example which can be studied using tmXPT at NLO is 

„2 

mlo - ml ± = -2w / (sino;o) 2 = -2w' /2 , 2 ■ (81) 

(Recall that w' <~ a 2 .) The splitting must vanish on the Wilson axis (as 
there is then no flavor breaking), and is necessarily even in /j 2 from the 
argument above. Not surprisingly, the splitting is maximized for ojq = ir/2. 
Similar results hold for A baryons 62 . Which pion is heavier depends on the 
sign of w', which, as already noted, also determines the nature of the phase 
structure in the Aoki regime 58 . 

Flavor-parity breaking of 0(a) does occur for unphysical parity- 
violating matrix elements. This is true for any non-zero twist, in- 
cluding maximal twist — the argument for automatic 0(a) improvement 43 
does not hold for such quantities. Results for the following parity- 
flavor violating form factors are available in tmXPT 55 : {iTb\A b ph , Pp^irs), 
(^b\A 3 phifl ,P^ h \TT b ) and (7r 3 |A3^,P p \|7r 3 ), where b = 1,2. Here is one ex- 
ample: 



{^b(p2)\PphWb(pi)} = iB sin w 



5 



(25 w - 8 w )q 2 



W 2(q* + m% 3 ) 



iB a sin(2tJo)w' 
H 5— o ' 



where q = p 1 — p 2 is the momentum transfer. These quantities provide 
an interesting window into the workings of tmLQCD, and are predicted 
once one has determined the LECs, but are difficult to study numerically 
because they involve quark-disconnected contractions. 

I am aware of one detailed comparison of the results from simulations 
of dynamical tmLQCD with tmXPT formulae 63 . (There is also a detailed 
fit to quenched data 64 which I discuss below.) The simulations are for rela- 
tively coarse lattices (a w 0.13 and 0.18 fm), yet find reasonable agreement 
with the NLO forms sketched above, with the continuum LECs consistent 
with continuum results, and the magnitudes of the "lattice" LECs con- 
sistent with expectations (although poorly determined). The asymmetries 
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discussed above are clearly present as a function of to'. The authors do 
their fits versus the PCAC mass, however, and for these the asymmetries 
are reduced, leading to the poor determination of the lattice LECs. 

These fits give one confidence that tmXPT is a useful tool, and is likely 
to become more so as lattice spacings and quark masses are reduced. Thus 
it can aid extrapolations, and, perhaps more importantly, guide the inves- 
tigation of the properties of tmLQCD, in particular its phase diagram and 
the issue of defining maximal twist. I return to these issues shortly. 

Other simulations have studied the pion mass splitting 65 ' 66 ' . This 
involves both quark-connected and disconnected contributions, with the 
latter hard to calculate accurately. Nevertheless, this is a key quantity 
to determine as it both sets the scale for isospin breaking and the size 
and nature of the phase structure. TmLQCD has some similarity to stag- 
gered fermions (although not the need to use rooting) because a desired 
continuum symmetry is broken (flavor for twisted mass fermions, taste for 
staggered), in both cases at 0(a 2 ). Fits to staggered fermions require this 
taste-breaking to be treated at leading order in "staggered" XPT (i.e. the 
equivalent of the Aoki regime here), because taste-splittings are large. Nu- 
merically, the splitting are ~ a 2 A 4 with Awl GeV. The hope for tmLQCD 
is that the flavor-breaking is smaller, and can be treated as a NLO effect 
(the GSM regime). One can also tune its size by varying the quark and 
gluon actions. Quenched results for the mass splitting (with Wilson fermion 
and gluon actions and a w 0.1 fm) find A w 0.7 GeV 65 (i.e. four times 
smaller splittings than with staggered fermions), and there are indications 
of a significant reduction if one uses dynamical quarks and improved gluon 
actions 66 or the non-perturbatively improved quark action 67 . Thus the 
situation is promising. 

3.8. Defining m c and the twist angle 

The critical mass plays a central role in tmLQCD, providing the origin 
about which one defines the twist angle (see the figure in sec. 3.7.1). As 
noted above, it must be determined non-perturbatively as part of the sim- 
ulation (and recalculated for each lattice spacing and choice of actions). 
The questions I address in this section are these: How accurately does one 
need to determine m c in order that automatic 0(a) improvement holds 
at maximal twist? What methods allow this accuracy to be achieved? I 
will discuss these questions using the framework of tmXPT, rather than 
than use the Symanzik EFT as in the original treatment 43 and subsequent 
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extensions 56,68,57 . TmXPT is less general (referring only to the vacuum 
and pion sectors), but more powerful in its domain of applicability (as it 
starts from the Symanzik EFT and includes further non-pcrturbative in- 
formation) . 

The form of the O(a) correction in the result for m 2 , eq. (76), is generic, 
namely that it is proportional to acoswo- To obtain automatic 0(a) im- 
provement one needs cosw = 0(a) and thus uu = n/2 + 0(a). In other 
words, maximal twist can mean "maximal up to 0(a)". What does this 
imply for the required accuracy in the determination of m c ? This depends 
on the relative size of quark masses and discretization effects. In the GSM 
regime, with /j, <~ gAq CD , an 0(a) accuracy in uo requires m' = 0(a 2 ) 
(and thus a determination of the dimcnsionlcss critical mass m c with an 
accuracy of (9(o 3 )). This is relatively straightforward to achieve, as we will 
see. Once one enters the Aoki regime, /U <~ a 2 Aq CD , the required accuracy 
increases to m! — 0(a 3 ). Since simulations are likely to need to enter this 
regime, it is important to know how to achieve this greater precision. 

The traditional definition of m c used with Wilson fermions is to extrap- 
olate m to the point where m 2 = 0. This method might be adequate in 
the GSM regime, but fails in the Aoki regime 56 . This failure has its ori- 
gin in the a 2 (w 1 ) term in eq. (76) and will be discussed in detail in the 
next section. Either the pion mass does not vanish but reaches a non-zero 
minimum at m c (at a first-order phase boundary), in which case extrapo- 
lating to ra\ = overshoots, or it vanishes over a range of m! of width a 2 
(the Aoki phase), with the correct choice of m c being in the middle of the 
range 56 but the extrapolation giving one of the end-points. In either case, 
m! ~ 0(a 2 ) at the putative critical point, which is not accurate enough for 
the Aoki regime. There are also practical issues with this method, reflecting 
the difficulty in doing accurate extrapolations, but I will not belabor them 
as this method is no longer being used in practice. 

To do better one can adapt the method used to determine the normaliza- 
tion of currents and improvement coefficients, i.e. enforce the symmetries 
that are broken by discretization. Here, parity and flavor are broken ex- 
plicitly, but are restored in the continuum limit. Enforcing this restoration 
in particular correlators for s ^ gives a non-perturbative determination 
of the twist angle, which can, if desired, be tuned to maximal twist. Since 
flavor and parity are broken, different choices of correlator lead to 0(a) 
differences in the twist angle, but all choices lead to automatic 0(a) im- 
provement. 

I will need to use the relation between twisted and physical bases dis- 
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cussed in sec. 3.7.1. Using this, a simple calculation finds the following 
relations between currents and densities in the two bases: 

A^ = cosu;4 + e 3bc sinu;F;, A% h<ll =A%, (83) 

V^ = cosu;^ + e 3bc sinu,^, V 3 h ^ = V 3 , (84) 

P 3 h = cos u P 3 + i sin lo S°/2 , P* h = P b , (85) 

S° h = cosluS + 2i sin ujP 3 , (86) 

where the flavor label b — 1,2. The flavor non-singlet scalar density vanishes 
at LO in SU(2) XPT and I do not discuss it. What these relations mean 
is that, if you are working in the twisted basis (as one usually docs on 
the lattice), then to construct the physical axial current with, say, 6=1, 
you must take a linear combination of the lattice currents A 1 and V£. 
These relations assume that the currents have been correctly normalized 
by multiplying by their corresponding Z-factors. Note that A^ and P b do 
not rotate, and so are good choices to create physical pions. 

The idea is now to take either (83,84) or (85,86) as a definition of uj, and 
enforce parity-flavor restoration in a particular correlator. Two examples 
are [method (ii) will be explained later]: 

Method (i) 69 method") (V^ >fl {x)P^(y)) cx (0\V p \J^) = . 

Method (iii) 55 ( Vp method") (S° h {x)A 3 ph ^(y)) K <0|Sj fc |7r 3 > = . 

The correlators are to be evaluated for x ^ y, and the long-distance con- 
tribution is as indicated. Using (83-86) one can manipulate these criteria 
into results for the twist angle in terms of correlators in the twisted basis: 

(Vfc)PHy)) + i(S°(x)Al(y)) 
t&nujA = , , , , , — , , , i , tanwp = — — „ / N - . (87) 
(Ai(x)pi(y)) 2(P*(x)A*(y)) 1 ' 

Maximal twist occurs when the denominators vanish, i.e 

uj A = n/2^(Al(x)P 1 (y))=0, u P = tt/2 => (Al(x)P 3 (y)) - . (88) 

While superficially similar, the two criteria differ because of flavor breaking. 
The correlator in method (iii) includes quark-disconnected contractions and 
is much more difficult to calculate in practice. I include it for illustrative 
reasons. Method (i) is used in practice. IX One fixes no and varies mo until 
= 7r/2. The resulting too(^o) depends on the choice of discretization 
of the axial current (e.g. 0(a) improved or not), and, in general, upon 
the separation x — y. At large distances, which are used in practice, the 



It has also been called the parity-violating method, or the PCAC method. 
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pion contribution dominates and the resulting mo becomes independent of 
separation. At such distances, method (i) is equivalent to the vanishing of 
the PC AC mass: 

mpcAC = (d^Al(x)P\y))/2(P\x)P\y)) = . (89) 

One nice feature of either criterion is that knowledge of Z-factors is not 
required, unlike the determination of u> at non-maximal twist. x 

Both methods (i) and (iii) can be implemented in tmXPT. With the 
technology developed above, we can work at NLO in the GSM regime, and 
calculate the correlators at long distances, for then the pion contribution 
dominates. I quote only the results at maximal twist: 55 

(i):cjA = 7r/2 => u = n/2 + 5 W , (iii): ujp = n/2 => lu = tt/2, (90) 

[with 5w defined in eq. (75)]. These results are shown in fig. 10. In method 
(i) one finds a line at an angle Sw ~ a to the vertical. Thus one is not at 
maximal twist in terms of ujq, but coswo ~ « so automatic improvement 
still holds. It turns out that method (iii) leads to a vertical approach to the 
Wilson axis. The methods come together at the critical mass. Thus one 
could implement method (iii) by using method (i) for /i ^ 0, extrapolating 
to /i = to determine m c , and then working at fixed toq = m c . 



method 
(iii) 


- 8 w/ 


/method 
/ (i) i 

method 
(iv) 

m' _ 




M/aZp) 


m^/(aZs) 

~a 


mgAa Zs) 



Figure 10. Results from different methods of defining maximal twist 



Quenched simulations at l/a ~ 2 GeV with unimproved Wilson 
fermions find an offset angle Sw ~ — 0.35 70 ' 71 . This is a direct measure 



X A generalization of method (i) allows the determination of lja for any twist angle 
without a priori knowledge of Z4 y 69 . 
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of discretization errors. Written as aA 2 this gives A w 0.7 GcV, a large 
but not unreasonable value. 

As noted above, the pion mass method can also be used, in principle, 
within the GSM regime. In practice the extrapolation can miss the critical 
mass, so it is useful to define a "straw-man" method (iv) in which m! is 
held fixed at a value of O(a). As is clear from the figure, loq varies from 
<~ 1 to as (i varies from ~ a to 0. Thus one is never near maximal twist 
in the GSM regime, and automatic improvement is lost. 

3.9. Results for m q ~ a 2 Aq CD (Aoki regime) 

As discussed above, to describe simulations we need to extend the analysis 
into the Aoki regime. This requires a change in the power counting: 64 ' 59 ' 57 
at LO we keep terms of size ml ~ \i ~ a 2 , 

^°Aoki = ^tr( J D At E^Et ) _^ tr(x 't E+p . c . ) _^[ tr( it s+p . c . ) ] 2 ; (91) 
while at NLO we keep those of size ma <~ aa ~ a 3 , 

tr(i f E) 3 +p.c. 

+Wtr(D^ D M S)tr(itS+ p.c.) - WTr(x' f E + p.c.)tr(i t E + p.c). (92) 

The two a 3 terms (those on the first line) are new, while previous NLO 
terms oc m 2 become of NNLO. In fact, the W$ t i term can be absorbed into 
x' by a further shift of 0(ct 4 ) in m c . This leaves one new LEC at NLO, 
Note that only the source parts of the W and W terms enter at NLO. 

In the following I will describe the analysis of the phase structure at 
LO, and then mention some modifications caused by NLO terms. 

The orientation of (E) is now determined by an equal competition be- 
tween the mass and a 2 terms in £^° Aoki - The former is minimized when 
(E) cx x' cx exp(iwoT3), while the latter either favors (E) = ±1 (W' > 0) 
or (E) = exp[i(7r/2)n • r] with h 2 — 1 (W < 0). The analysis along the 
Wilson axis is simple 41 , and one finds two cases. Either the condensate 
jumps discontinuously from (E) = +1 to (E) = — I at a first-order transi- 
tion {W 1 > 0), or it swings between these values continuously, as is possible 
within SU{2) (W < 0). In the latter case, the condensate breaks flavor, 
so there are exact lattice Goldstone bosons. This is the Aoki phase 61 . The 
presence of two possible phase structures was also predicted by Creutz 72 . 

Moving into the twisted-mass plane the analysis becomes more compli- 
cated as one must minimize a quartic 53 ' 58 ' 55 . I show below how the two 



- -^tr(iti)Tr ( itE + p.c.) - ^ 
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scenarios on the Wilson axis extend into the mass plane. 
P 
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Here I use mass variables rescaled by 
2B$m' m! 



2B Qf i 



(93) 



(l6\W'\a 2 /f 2 ) \w'\ ' ^ (16|W|a 2 // 2 ) K| 
Both scenarios have a first-order transition boundary, indicated by the 
solid line, with second-order end-points. Note again that the parameter 
w' , whose sign determines which scenario applies, and whose magnitude 
gives the size of the phase boundaries, is the same parameter as appears in 
the pion mass splitting 58 , eq. (81). Thus a calculation of this splitting in 
the GSM regime (which, as noted above, has been attempted) predicts the 
phase structure in the Aoki regime. 

Wu and I have given detailed plots of the condensate and pion masses 
along the dashed horizontal lines in the phase diagrams above 55 . I show 
here only a sample. I write the condensate as (E) = A m + iB m Ts, and 
plot the scalar component A rn , as well as the charged and neutral pion 
mass-squareds. Figure 11 shows results for W' < 0. The "swinging" of 
the condensate between ±1, described in words above, is here the (3 = 
curve. The charged pion masses vanish when \A m \ < 1, for then \B m \ > 
and flavor is spontaneously broken. Moving away from the Wilson axis 
(/3 = 1,2,3), the Aoki-phase is washed out by the explicit breaking of 
flavor. The neutral pion (not shown) is always heavier than the charged 
pions. 

If the mass is purely twisted (a — m' — 0) then one finds for all (3 
that A m = 0, so the condensate points in the T3 direction. Thus, despite 
the large discretization errors, the condensate ends up pointing in the same 
direction as it would without them. This is one way of understanding why 
automatic 0(a) improvement continues to work, as discussed further below. 

I illustrate the results for the first-order scenario (W 1 > 0) in fig. 12. 
The left panel shows the discontinuity in the condensate for |/3| < 2, trans- 
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Figure 11. Condensate and charged pion masses for W < 0, for four choices of twisted 
mass (/3 = 0, 1, 2, 3). The thickness of the lines decreases with increasing /3. 



forming into a smooth crossover for \(3\ > 2. Above the transition, note 
again that A m — when a = 0. The right panel shows that the pion 
mass on the Wilson axis (/3 = 0) has a non-zero minimum, while for /3 = 2 
the neutral mass goes down to zero at the second-order end-point. In this 
scenario, the charged pions are always heavier than the neutral once one 
moves off the Wilson axis. Unlike the Aoki-phase scenario, however, no pi- 
ons are massless along the phase boundary away from the end-points. This 
is because no lattice symmetry is broken along this transition. 



Figure 12. Condensate and pion masses for W' > 0. Notation as in fig. 11, except 
that pion masses are shown only for /3 = 0, 2. For (3 = charged and neutral pions are 
degenerate. For fi = 2, the neutral mass is shown dashed. 




A 
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3.9.1. Applications to lattice simulations 

(I) The most important lesson we learn from tmXPT is to expect non-trivial 
phase structure when m q ~ a 2 , with two possible scenarios. The predic- 
tion of an Aoki-phase was actually made long before the tmXPT analysis 61 , 
in order to understand how m 2 could vanish without an underlying chiral 
symmetry, and quenched studies in the 1980's and 90's found evidence for 
such a phase. What has happened in the last few years is the beginning 
of detailed dynamical studies of the twisted-mass plane. Those with unim- 
proved Wilson gauge and fermion actions found evidence for the first-order 
scenario. I think it is fair to say this was a surprise, and that it was pre- 
dicted by XPT I view as a significant success. It is amusing that for decades 
we have been assuming that m 2 extrapolates all the way to zero, whereas 
(at least with the simplest actions) it actually never makes it. 

I show an example of the evidence for the first-order scenario in Fig. 13. 
This shows scans at fixed twisted mass (with /i roughly fixed in physical 
units) for three lattice spacings (a decreasing as (3 increases) . The plaquette 
and ttipcac both have a discontinuity, and show hysteresis. Both effects 
decrease as one approaches the continuum limit, qualitatively consistent 
with expectations. The fact that ropcAC bas a minimum away from zero 
is a manifestation of the non-zero minimum in the pion masses. 
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Figure 13. Average plaquette and mpcAC plotted vs. (2k) 1 = mo + 4 for fixed fig 7 



(II) The predictions for physical quantities that I sketched above in the 
GSM regime have been extended to NLO in the Aoki regime 55 ' 64 ' 59 ' 57 . 
This is not trivial to implement, since minimization of the potential (with 
the W"3,3 term included) now involves a sextic equation. Detailed fits to 
quenched data have, however, been done, and find reasonable values for 
the resulting LECs 64 . 

I cannot resist showing one NLO prediction. Discretization effects dis- 
tort the contours of constant m 2 + from circles into the forms shown in 
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fig. 14. Here m" is just the shifted mass m' with the 0(a 3 ) term propor- 
tional to W$ t i absorbed as well. These plots are illustrative of the size 
of the expected effects, not the result of details fits. I have taken stan- 
dard LO continuum LECs, and set the NLO continuum LECs and chiral 
logs to zero for simplicity. For the lattice LECs I use Sw = = —0.3, 
\w'\ = (250 MeV) 2 and Ws t s — 0, values which are not unreasonable for 
a -1 w 2 GcV. Quark masses range up to <~ m s /2. Clearly the impact of 
discretization errors and phase structure could be very significant for actual 
simulation parameters. 



Contours of mpi~2 Contours of mpi A 2 




-0.04 -0.02 0.02 0.04 -0.04 -0.02 0.02 0.04 

m' ' m' ' 



Figure 14. Contour plots of re? ± in twisted mass plane, for Aoki-phase (left) and first- 
order (right) scenarios, using a representative parameter set for 1/a ss 2 GeV. Quark 
masses arc in GeV. Raggcdncss in contours is due to numerical errors. 



(Ill) Another important question that can be studied using tmXPT in 
the Aoki regime is whether automatic 0(a) improvement at maximal 
twist still holds using the criteria introduced in sec. 3.8. The answer is 
positive 56 ' 55,59 ' 57 , aside from a caveat I will explain. In fig. 15 I show the 
impact of NLO corrections on the phase boundaries and the lines of max- 
imal twist 59 . These plots allow one to understand the qualitative features 
of the contours in fig. 14. 

The lines of maximal twist shown in the figure are those for methods 
(i) and (iii). Along either line the physical quark mass is proportional to 
\i. If one simulates on one of these lines, reducing a while holding /i fixed 
(i.e. working at fixed physical quark mass), then the dependence of physical 
quantities on a will be quadratic or of higher order. This is automatic 0(a) 
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Figure 15. Phase diagram at NLO for Aoki-phasc (left) and first-order (right) scenarios. 
Method (i) corresponds to u>a = t/2, method (iii) to uip = n/2. 

improvement. In the Aoki-phase scenario it holds even for fi <C a 2 , i.e. all 
the way to the Wilson axis. For the first-order scenario the result breaks 
down, however, when one runs into the end-point with \l w \w'\ <~ a 2 . This 
is the caveat mentioned above. 

It is perhaps surprising that one can work in a regime where the quark 
mass is much smaller than the leading 0{a) effects of discretization, and 
yet be able to tune parameters so these effects do not enter into physical 
quantities. It is worth understanding this qualitatively. The point is that 
the tuning has to be very fine. If \l <~ a 2 , then, as fig. 15 shows, one must 
tune m" to an accuracy of a 3 . This is what the criteria in eq. (88) accom- 
plish. In terms of the Symanzik Lagrangian, this amounts to canceling the 
untwisted mass (so that it is 0(a 3 ) or smaller). The result is 

L {4+5) = + M^75T3V + ac^io^F^iP + 0(a 2 ) (94) 

= V'physPV'phys + ^V'physV'phys + aC^^^T^O ^phys , (95) 

where in the second line I have rotated to the physical basis. The absence of 
a iptp term on the first line means that the O(a) term is purely flavor-parity 
breaking in the physical basis, and does not contribute to physical matrix 
elements however large it is. XI This is nothing more than a summary of the 



A similar argument can be made directly at the chiral Lagrangian level, and relies on 
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original argument for automatic improvement 43 (although simplified, as I 
am not including operators). The only new point here is that the argument 
goes through for arbitrarily small /i as long as to" is tuned accurately 
enough. 

It is now time to unveil what I call "method (ii)" , which works as follows. 
One first defines a critical mass by extrapolating results from method (i) 
to the Wilson axis, and then keeps too fixed at this critical mass for all 
/i. It turns out to correspond to working at to" = 0. In the GSM regime 
(recall fig. 10) method (ii) is equivalent to method (iii), but in the Aoki 
regime the line it defines is offset from that of method (iii) by 0(a 3 ). Such 
a difference might seem too small to be significant, but it need not be 57 . 
To see this, consider the Aoki-phase scenario. Then from fig. 15 one sees 
that method (ii) interpolates between method (i) {fi < a 2 ) and method (iii) 
(yu » a 2 ). Now, it turns out that the condensate has a fixed orientation as 
one moves along the lines of either method, but the orientations for the two 
methods differ by an angle of 0(a). Thus for method (ii) the condensate 
varies direction rapidly for \i < a 2 , with a gradient of <~ a /a 2 . This can 
disrupt extrapolations as the resulting form is not a simple polynomial 57 . 
I raise this worry because method (ii) is being used in practice by some 
groups, and it would be preferable to use method (i). 

(IV) A further lesson from tmXPT is that both the size of the phase bound- 
aries and the isospin splitting for pions are determined by the same param- 
eter, w' . It thus makes sense to try and tune the gauge and fermion actions 
to reduce \w'\. Note that this tuning is not the same as a systematic im- 
provement program, but it is nevertheless very important. Initial results in 
this direction are very encouraging. For example, fig. 16 shows that the dis- 
continuities in the plaquette are significantly reduced by alternative gauge 
actions. xn It is also found that using an improved fermion action reduces 
the pion isospin splitting and thus w' 67 . This is an active area of present 
research. 

(V) It might appear from fig. 15 that, if we had to choose, we would prefer 
the Aoki-phase scenario because we can work at maximal twist down to 
/i = 0. I think, however, that the choice is not so clear. To illustrate why, 
I compare the results for pion masses in the two scenarios in fig. 17, using 



the result, noted above, that the condensate points near the T3 direction at maximal 
twist 53 > 55 . 

XII In fact, the reduction is greater than appears, as the results with the Wilson action 
are for /i ^ 0, where the transition is weaker than for fi = 0, the value used for the other 
actions. 
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Figure 16. Hysteresis of plaquette at a xs 0.17 fm for DBW2 (top), tree-level improved 
Symanzik (middle) and Wilson (bottom) gauge actions. 74 . 

the same "reasonable" choices of LECs as above. The role of the charged 
and neutral pions is interchanged in the two scenarios, and there is no clear 
advantage to using one over the other. The figure also emphasizes the 
significance of the 0(a 2 ) effects. I conclude that it is much more important 
to reduce the magnitude of w' than it is to end up in one or other scenario. 




Figure 17. m ^± (solid) and m 2 (dashed) in GeV 2 versus ji in GeV for method (i). 
Left panel: Aoki-phase scenario; right panel: first-order scenario. 



3.9.2. Bending near maximal twist 

The discussion to this point makes clear the importance of accurate tuning 
to maximal twist. Initial studies of tmLQCD did not always attain this 
accuracy and observed a phenomenon called "bending" . Although this is 
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largely a closed chapter in the history of tmLQCD, it is worth learning 
the appropriate lessons, particularly as tmXPT played an important role in 
elucidating the phenomenon. 



The sketch to the right shows 
the Aoki-phase scenario with three 
choices of the approach to the chiral 
limit shown by arrows. They are all 
nominally at maximal twist. These 
are method (i) [thick lines]; fixing 
m c at an endpoint where = 
[medium thickness] , and missing the 
endpoint by ~ a 2 [thin lines]. 




m" 



I show in fig. 18 the NLO tmXPT results (using the same parameter 
set and assumptions as above) for m 2 ±/[i for the three methods (with 
the thickness of the lines matching those above), along with results from 
simulations. The horizontal axis is /i, which is the quark mass when working 
at maximal twist. For my parameters (i.e. physical LECs and chiral logs 
dropped) the correct result is a constant, and is illustrated by the 0(a) 
improved result from method (i) [thick line] . The results from the other two 
choices show a noticeable bending away from the continuum result at small 
masses, i.e. a breakdown of 0(a) improvement. The numerical results are 
for a similar range of quark masses, but the axes are in different units and so 
the two plots can only be compared qualitatively. The numerical results arc 
for method (i) ("PC AC definition" — triangles) and the "m^ = definition" 
(circles), and are qualitatively consistent with tmXPT. A detailed fit of 
these and other quenched data to quenched tmXPT has been done, and 
finds quantitative agreement 64 . 

The most important conclusion from this figure is that one should use 
a good definition of maximal twist, such as method (i). In particular, 
the failure of the m n — method is apparent. This can be understood 
qualitatively as follows. The condensate at maximal twist should lie in the 
T3 direction, (£) = ir^ + 0(a). This is the case with method (i). In the 
other cases, however, the condensate starts in this direction for /i»a 2 but 
rotates away when /i ~ a 2 and ends up at (£) = — 1 when /i = 0. Thus 
automatic 0(a) improvement is lost. xnl 



XIII If one treats deviations from (£) = ir3 as small perturbations, then discretization 
errors have tt° poles and are infrared enhanced 68 . This appears superficially like a break- 
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Figure 18. "Bending" in rn^/fj, vs. /i from tmXPT 59 (left — both axes in GeV) and 
quenched simulations 44 (right — both axes in lattice units at l/a fa 2 GeV using bare fj,). 

3.9.3. Does tmLQCD work in practice? 

TmXPT has been a very useful guide to the exploratory numerical studies 
of tmLQCD, but the most important question is whether working at max- 
imal twist is practical. In particular, is the requisite fine tuning possible, 
and cheap enough, in practice? Are results for physical quantities really im- 
proved? These questions are being actively studied and the answers appear 
to be positive 44 ' 74,75 . 

A different question is how best to include the strange quark, as one 
needs an even number of quarks to implement maximal twisting. One 
could just add a single non-perturbatively 0(a) improved untwisted strange 
quark to a maximally twisted up-down pair, but this would require non- 
perturbative improvement of each new operator involving the strange 
quark, which is exactly what we are trying to avoid with automatic im- 
provement at maximal twist. An alternative is to introduce the charm 
quark as well, and treat the strange-charm pair as a twisted doublet 44 . Non- 
degenerate masses can be incorporated while maintaining maximal twist 76 . 
The disadvantages are that flavor breaking is more complicated (occurring 
in both T3 and T\ directions in the strange-charm sector), and that (am c ) 2 
effects may not be small. This option is being actively studied 77 . 

Finally, I think it likely that isospin breaking in pion masses will need 
to be included in loop calculations, requiring one to work at NNLO in the 
GSM power counting. First steps in this direction have been taken 78 . 

For further possible uses of tmLQCD, see the lectures by Sint. 



down of the Symanzik expansion, but, in fact, the divergences are summed automatically 
in tmXPT if one expands about the correct vacuum 59 . 
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4. Partial quenching and PQXPT 

My final lecture is devoted to introducing partially quenched QCD and the 
corresponding EFT, PQXPT. I will consider PQQCD only in the contin- 
uum, and focus on conceptual issues. I will not discuss the extension to 
include discretization errors — this is straightforward following the methods 
of the previous lecture — nor the related topic of mixed action XPT. Nor will 
I discuss quenched QCD and quenched XPT (QXPT), except in passing, as 
it is not useful for obtaining quantitative physical information about QCD. 



4.1. What is PQQCD and why is it useful? 

I have given an overview of the meaning of partial quenching in the intro- 
duction. Here I give a concrete example for the pion correlator in QCD. If 
we write out how this is calculated in detail, 

C w (r) = -/^u 75 d(f ) r) d<y 5 u(0)\ (96) 

= -\\ DU]jDqDqe- S ^- f *^^ {p+m « )q Y,ui5d(x,T) d-y 5 u(0) 
J q ■ 

= ~Z l £ ' C/ II dCt ^ + TO 9) e ~ SEaUEC X] tr 



75 



1 



cx /^e - " 1 " 1 " + exponentially suppressed. 



1 




we see that, in practice, we are free to use different masses in the deter- 
minant ("sea" or "dynamical" quark masses) and in the propagators ("va- 
lence" quark masses). Doing so is called partial quenching. It is relatively 
cheap to do so, since the calculation of propagators is usually a small over- 
head on the generation of configurations, thanks to the determinant, and 
because the rate of increase of CPU time as m q decreases is less severe for 
propagators. The big question is whether we can make use of the extra 
information provided by working with m va i ^ 

At this point it is appropriate to interject a comment on nomencla- 
ture. In the bad old days we used the quenched approximation, in which 
m sca — > oo, so the quark determinant could be ignored. This allowed sim- 
ulations to be done, but at the heavy cost of ignoring quark loops and 
thus considering a theory which is not unitary and had various sicknesses. 
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QQCD is at best a model of QCD — there is no quantitative relation be- 
tween the two theories. Partial quenching is in one sense a less extreme 
version of quenching, and thus the name. If m sca S> Aqcd then PQQCD, 
like QQCD, is only qualitatively related to QCD, and the name is appro- 
priate. What I consider here, however, is PQQCD with sea quarks light 
enough, m sea <C Aqcd, that (as I will argue) we can use XPT to relate the 
results quantitatively to physical QCD. For such theories a more appropriate 
name, with less negative connotations, would be "partially unquenched". 
But I will not try and create a naming revolution and will stick with the 
canonical nomenclature. 

One thing that having light sea quarks does not change is that PQQCD 
is unphysical. To see this consider a contribution to ir°ir° scattering in 
QCD: the two pions first annihilate into glue, from which is created, say, 
a 7r + 7r~ intermediate state, which subsequently annihilates back into glue, 
and then finally a new 7r° pair is created. In the corresponding process 
in PQQCD, the intermediate tt + it~ pair must be composed of sea quarks, 
and thus have different masses from the valence 7r° pairs. Having different 
intermediate and external states means that the theory is not unitary. An- 
other unphysical feature is the appearance of double poles in propagators. 
This is not possible if one can insert a complete set of physical states. The 
fact that there are double poles can be seen in PQXPT 79 , but can also be 
seen more generally from the properties of the quark-level theory 80 . 

Because PQQCD is unphysical, it is essential to have a quantitative 
method relating its properties to those of QCD. This is provided by PQXPT. 
Thus we must simulate in the regime of quark masses where XPT is valid if 
PQQCD is to be a useful tool. This is illustrated above in fig. 1. Note that 
the presence of the physical subspace with m va i = m sca within PQQCD is 
crucial in order to pin down the relation to QCD. 

4.2. A field theoretic formulation of PQQCD 

As we have seen, the construction of an EFT makes essential use of sym- 
metries. Thus to develop PQXPT we need a formulation of PQQCD which 
makes its symmetries manifest. One way to do this is to use Morel's trick 81 
of introducing commuting spin- 1/2 fields (ghost quarks labeled q), whose 
determinant can cancel that from the valence quarks: 




(97) 
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The formulation then includes three types of quark: valence quarks qyi, 
qv2, ■ --qvNv, {N v = 2,3, . . .), sea quarks q S i, qs2, ■ --qsN (N = 2,3), and 
ghost quarks qvi, qv2, ■ ■ ■ qvN v {Nv = 2, 3, . . . ). The ghosts are degenerate 
with corresponding valence quarks. It is notationally convenient to package 
these fields into (N + 2iVy)-dim vectors: 

Q tr = qvi, ■ ■ ■ , qvN v , qsi, • ■ • , qsN, qvi, ■ ■ ■ , qvN v , (98) 

V . ' V v ' „ '/ 

valence sea ghost 

Q = ( qvi? ■ ■ ■ >qvN V > Qsit ■ ■ ■ >~q~sN> Qvv ■ ■ ■ ^vn v ) > (99) 

V v ' " v ' V „ '/ 

valence sca ghost 

and similarly for the masses 

M = ( mvi, ■ ■ -,m VNv , m S i, ■ ■ -,m S N, m vl , . . .,m VNv ) . (100) 
V y ' v * ' v '/ 

valence sea ghost— valence 

The action of PQQCD is a simple extension of that of QCD 

•SpQ = -Sgaugc + Jq(P + m)q (ioi) 

N v N 

Q(p + M)Q = J2 QviiP + m vi )q vi + ^ q Sj (p + m Sj )q Sj 

i=l j=l 
N v 

+ Y,qvk(P + m vk)qvk- (102) 
fc=i 

If we similarly define an extended measure as 

N v N 

DQDQ = Y[ (Dq Vl Dq Vl Dq vi Dqv) J] {Dq Sj Dq Sj ) , (103) 
i=i j=i 

then we can write down a partition function containing valence quarks that 
nevertheless reproduces that of QCD: 

Z PQ = J DUDQDQ e~ 5p « (104) 

^/^--iSn^t^ <«») 

N 

(106) 



= f DUe~ s '*°- f[ det(p + m Sj ) = Z QC n 

J .5=1 
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So far this is rather trivial. The power of the method is that it provides 
field-theoretic expressions for PQ correlation functions, e.g. 

CP Q (r) = -Zpi J DUDQDQ Y,u vlb d v {x, T ) d vl5 u v (0)(101) 

X 

N 

= Z P ^ DUj[ det(D + m Sj )e- s ^° 



75 -fT- 75 



P + m vd J xQ \P + m Vu/Qx 



(108) 



This is exactly the pion correlator with which we started, eq. (96), but with 
differing valence and sea quark masses. XIV 

As noted above, PQQCD is "anchored" to QCD (or, more precisely, 
to physical, QCD- like, theories). Let us see how this works in our new 
formulation. Consider the PQ pion correlator, but now set the valence 
quark masses equal to two of the sea quark masses, i.e. my u — msj and 
mvd — tnsk- Then the PQ correlator is equal to a physical QCD correlator: 

Cl Q {r) = Z p ^ f DUDQDQ e' 3 ^ ^u vl5 d v {x,T) d v -f 5 u v (0) (109) 
= Zp^ J DUDQDQe- S ^^q Sjl5 q sk {x,T)q skl5 qsM( 1 ^) 

X 

N 

= ^qcd / DUY[Dq Si D qsi e- s ^ 

x J2^sjl5qsk(x, t) q Sk "/5qsj{0) (in) 

X 

= CQ CD (r). (112) 

To obtain the second line I used the result that the propagators obtained by 
doing the Wick contractions are the identical to those from the first line. 
This is an example of the enlarged symmetry of the PQ theory. Having 
removed the valence quarks from the operators, the valence and ghost inte- 
grals cancel, leaving a QCD correlation function. This analysis generalizes 
to any correlation function containing N or less different valence quark- 
antiquark pairs (recall that we can add any number of valence quarks) . If 
there are more than N such pairs, e.g. a two point function containing 



This formulation works as well for quenched QCD — one just omits the sea quarks 
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QvilV2<ivz < iVA and its conjugate in the presence of 3 light sea quarks, then 
there is no corresponding QCD correlator even if valence masses are all 
equal to sea masses. This is an example, developed further below, of how 
PQQCD gives one access to combinations of Wick contractions that do not 
occur in QCD itself. 

The field theoretic formulation shows that PQQCD is a well-defined 
statistical system. In particular, the ghosts do not present a theoretical 
problem in the Euclidean functional integral. As long as quark masses are 
positive, the functional integrals over the ghost quarks converge (since }Z> 
has imaginary eigenvalues). Of course, the theory remains unphysical, and 
indeed we can now see this more directly. If we rotate to Minkowski space 
we will violate the spin-statistics theorem, and thus have an unphysical 
theory. Put another way, PQQCD docs not satisfy reflection positivity (as 
can be seen, for example, from the fact that the ghost pion correlator has 
the opposite sign to that for the normal pion), and so one cannot construct 
a physical Hilbert space with a positive Hamiltonian. But the unphysical 
nature need not be a problem if we use PQQCD in Euclidean space as a 
tool to gain information about QCD. 

4.3. Developing PQXPT 

Before diving into the theoretical details let me give some qualitative moti- 
vation for what follows. We will need to assume that PQQCD is described 
by an effective theory close to that for QCD. I think it is fair to say that our 
confidence in this assumption is based in part on results from simulations. 
In particular, the charged pion correlator, C^ q (t), has essentially the same 
form at long distances in PQQCD (and also QQCD) as in QCD: it falls 
as exp(— m^r) at long times, and m 2 oc (my M + mvd) to good approxima- 
tion. There is no sign of unphysical effects, e.g. double poles [which fall as 
t exp(— m w i)], or negative residues. It is only when one looks in detail that 
one finds deviations from QCD expectations, such as the enhanced chiral 
logarithms in QQCD. There are also other correlators where partial quench- 
ing has a dramatic effect: double poles do appear in the rf correlator, and 
there are negative contributions to the scalar-isovector correlator. Never- 
theless, the apparent closeness of the infrared physics of PQQCD to that 
of QCD provides important motivation for the development of (P)QXPT. 

This development has been done using two methods. The first is the 
"graded-symmetry" method 79 based on Morel's trick and which I use here. 
This is an extension of earlier work on the quenched theory 82 . The main 
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issue, as we will see, is the whether the usual "derivation" of an EFT 
goes through when the theory is unphysical. The second method uses the 
"replica-trick" 83 , in which one removes the valence determinant by sending 
Ny — * rather than using ghosts. xv This has the advantage that the 
theories with integer Ny, from which one is extrapolating, are physical, 
and so one expects their long-distance physics to be described by XPT. 
Its disadvantage is that extrapolating in Ny is, in general, theoretically 
uncontrolled. 

For the purposes of the present lecture I could use either method, since 
they are known to give the same results at one-loop in PQXPT, and it 
is plausible that this holds to all orders 83 . I choose the graded-symmetry 
method as I am more familiar with it. XVI 

4.3.1. Symmetries of PQQCD 

In the notation developed above, Spqqcd = ^gaugo + Q(P + ■M)Q looks 
just like Sqcd, and appears to have a graded extension of chiral symmetry 
when M — > 0, involving rotation of quarks into ghosts and vica-versa: 

Ql.r — U L . R Q L , R , Ql r > Ql r^l r > u l,r e SU(N V + N\N V ). 

(113) 

The apparent symmetry is SU(N V + N\N V ) L x SU(N V +N\N V ) R x U(l) v . 
In fact, there are subtleties in the ghost sector: the transformations are 
inconsistent with the requirement that Q contain q v in the ghost sector 
(and thus is related to Q, unlike in the quark sectors). This is necessary 
for convergence of the ghost functional integral 86 . I do not have space 
to discuss this technical detail, and I refer the interested reader to the 
literature 86 ' 87 ' 80 ' 88 . The bottom line is that, for perturbative calculations 
in the EFT, one gets the same answer using the apparent symmetry group. 

4.3.2. Brief primer on graded Lie groups 

Since these are lectures, I recall a few basic properties of graded Lie groups. 
Graded means that the group matrices, U, contain both commuting and 

xv This method gives a formalization of the "quark-line" method which I used to develop 
quenched XPT 84 . 

XVI The equivalence is less well established in contexts where EFT calculations are non- 
perturbative, such as in the e— regime. Here calculations are harder in both approaches, 
but agree where they overlap 85 . 
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anticommuting elements 
/ A B \ 

U = C D A, D commuting; B, C anticommuting . (114) 
\N V +N n v ) 

U is unitary [i.e. U € U(N V + N\N V )} if UW = WU = 1, as for normal 
matrices, as long as one complex conjugates anticommuting variables as 
(Viifc)* = vZv*- Trace is generalized to "supertrace" , defined to maintain 
cyclicity: 

str U = trA - trD str(f/if/ 2 ) = str([/ 2 ?7 1 ) . (115) 

Determinants generalize to "superdeterminants" , 

sdet U = exp[str(ln[/)] = det(i4 - BD^Q/detiD) , (116) 

which satisfy sdet(U\U2) = sdet (Ui) sdet (/7 2 )- Using this one can define [/ e 
SU(N V + N\N V ) as unitary graded matrices with unit superdeterminant. 

To get a feel for the subtleties of the graded groups it is useful to consider 
examples of SU(Ny + N\N) matrices: 

/ e i9N v q \ ( e i6N v \N+N v 

f>/ = o e *("+"v) J sdct ^ = (e i8(iv+Arv))Arv = 1 - (118) 

[// looks just like an SU{2Ny + N) matrix, while Uu does not (having a 
determinant differing from unity) . Its superdeterminant is unity thanks to 
the det(D) in the denominator of (116). 

One feature of U (Ny+N\Nv) which is unchanged from ungraded groups 
is that one can pull out a commuting U{1) factor, U(Nv + N\Ny) = 
[SU(N V + N\N V ) <g> U(1)]/Z N , with the [7(1) being a phase rotation: xvn 

/pie o \ ie(N+N v ) 
Um = ( „ e „ J =► sdet [7 77/ = ^ = e ieN . (119) 



4.3.3. Chiral symmetry breaking 

We now follow the same steps as we did for QCD in sec. 2.2, noting differ- 
ences along the way. We expand PQQCD about M. — 0, where the chiral 



XVII Note that sdet(C//j) = 1 in the quenched theory (TV = 0), so that Ujjj lies in 
SU(Ny\Ny), indicating that the quenched group structure is more complicated. 
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symmetry group G = SU(N V + N\N V ) L x SU(N V + N\N V ) R is exact xvm 
We know that this symmetry is broken, because it is broken in the massless 
QCD contained in massless PQQCD. To study the symmetry breaking we 
introduce a graded generalization of the order parameter (6) used for QCD: 

fi« = (QL,i,a,c3fl, j , Q ,c>PQ ^U L nU R . (120) 

Next we assume that the graded vector symmetry, SU {Ny +N\Ny)y , is not 
spontaneously broken. If M is diagonal, real and positive, this follows from 
an extension 80 of the Vafa-Witten theorem for QCD 23 . The assumption is 
thus that nothing singular happens as M — > 0. Given this, we know that 
n=wxl, and, furthermore, we know from the QCD sub-theory that u = 
(QsQs) 0- Thus the symmetry breaking is G -> H = SU(N V + N\N V ) V , 
a simple graded generalization of that in QCD. 

We can now derive Goldstone's theorem using Ward identities for two- 
point Euclidean correlators, which remain exact in the PQ theory 80 . The 
result is that there are massless, spinless poles which couple to Q"f^j5T a Q 
for all (N + 2N V ) 2 - 1 traceless generators, T a , of SU(N V + N\N V ). Of 
these, 2{N + Ny)Ny are fermionic (quark-ghost particles), (N + Ny) 2 — 1 
are bosonic with normal sign two-point functions (quark-quark), and N v 
are bosonic with unphysical sign two-point functions (ghost-ghost). 

4.3.4. Constructing the EFT 

It would appear that we have the ingredients to construct an EFT, just as in 
QCD: we know the symmetries, and we have a separation of scales. In fact, 
we know much less than in QCD, because PQQCD is not physical. In QCD, 
the poles in two-point functions correspond to particle states in a physical 
Hilbcrt space, and thus we know that correlation functions of arbitrary 
order will also have these poles, and from their residues we can extract the 
S-matrix, which will be unitary. Furthermore, since QCD involves local 
interactions, the S-matrix will satisfy cluster decomposition. Then we can 
invoke Weinberg's "theorem" and write an effective, local field theory for 
the light particles. In PQQCD, by contrast, we do not know that the 



xvm^ posteriori we will find that we must take the chiral limit with my and mg in 
fixed ratio, because there are divergences if my — > at fixed mg 89 . This non-analyticity 
is not a barrier to the construction of the EFT. Non-analyticitics are present in physical 
quantities in continuum XPT as well (e.g. the chiral logs), but arise from infrared physics, 
just as the divergences here. There is no reason to think that the coefficients in the EFT, 
which result from integrating out ultraviolet physics, are non-analytic in my,m$- 
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poles we have found in two-point functions also appear in higher order 
correlators, and we do know that there is neither a physical Hilbert space 
nor S-matrix. Indeed, it can be shown that neutral correlators have double- 
poles if my 7^ m/°. So we cannot rely on Weinberg's argument. Instead 
we must simply assume that there is a local EFT containing the Goldstone 
modes and satisfying the symmetries. In other words we assume a minimal 
change from the EFT for QCD. 

This is not as ad hoc as it might sound. Let me give three arguments 
in favor of this assumption. First, we know that there is a local EFT for 
the QCD sub-theory, and that this describes the long distance behavior 
of correlators. What PQQCD allows one to do is to separate individual 
Wick contractions contributing to QCD correlators. It seems implausible 
that, for example, the description of these individual contractions would 
require a non-local interaction (leading to a different pole structure) whose 
effects cancel when one adds them to form QCD correlators. Second, one 
can derive chiral Ward identities in PQQCD for arbitrary order correlation 
functions, that are generalizations of those in QCD. PQXPT satisfies these 
identities by saturating them with Goldstone pole contributions. In this re- 
gard, there is numerical evidence from simulations that in, say, four-point 
functions the Goldstone poles dominate when one pulls one of the opera- 
tors far from the others, as predicted by PQXPT. Finally, one can imagine 
carrying out a Wilsonian rcnormalization group program in a Euclidean 
theory, in which one successively integrates out "shells" of high-momentum 
modes. This automatically leads to a local interaction, and symmetries are 
preserved. One can think of Symanzik's EFT for lattice QCD as an exam- 
ple. We cannot actually do the calculation here, given the non-perturbative 
physics of QCD, but if we could, and if the two-point functions correctly 
tell us the appropriate low-energy degrees of freedom, it is plausible that 
we would end up with PQXPT. 

Having assumed the nature of the EFT we continue following the same 
steps as for QCD. We "promote" the condensate into a field, 



and, assuming standard masses so that (£) = 1, we define NG particles by 



Sl/w ^> H(x) e SU(N V + N\N) , T,-^U L T,U J 



n ■ 



(121) 





(122) 



Here </> are the quark-quark "normal" NG bosons, </> are the ghost-ghost NG 
bosons, and 77^2 are quark-ghost NG fermions. The constraint sdet £ = 1 
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implies str $ = tr <f> — tr <f> = 0. The QCD part of E is 

( \ 

w(x) 



*(a:) 




£ = E QCD . (123) 



V N v N N v J 

Next we construct the most general local, Euclidean- invariant, G- 
invariant Lagrangian built out of E, D^E and X . Here the covariant deriva- 
tive and x are graded generalizations of the corresponding terms in XPT. 
In particular \ — 2B (s + ip) — > UlxUr, with the sources set to s = M 
and p = at the end. We can use generalizations of the same building 
blocks as in XPT, e.g. 

L M - SD^Et ^L^C/t ; str(i ^ = o ; (124 ) 

The power counting (which is independent of the nature of the fields) is the 
same as in XPT. 

In this way we arrive at the PQ chiral Lagrangian through NLO: 

4q = x str ( D ^ D ^) - x str(xSt + Ext) (125) 

4q = ~Li [str(D M ED M Et)] 2 - L 2 str(D^D^)str(D^D^) 
+L 3 str(D^D^D^D,^) 

+L 4 str( J D Al E t J D Al E)str( X t E + E t x) + L 5 str^E^E^E + E^]) 
-L 6 [str( X tE + Et x )] 2 -L 7 [str^E-E^)] 2 -L 8 str( X t E X t S+? ,. c .) 
+iL 9 str(L My L) M EL) ly E t +p.c.) + Li str(L Ai „Ei? M ^E t ) 

str (L^L^ +p.c.) - H 2 str( X f X ) + £ W zw,pq 
+L PQ PQ . (126) 

These are almost carbon copies of the corresponding results in X PT 
[cqs. (28,31)], except that tr — ► str, and there is an additional term in £p 4 Q 
(the Opq term) 90 . To my knowledge, no-one has worked out the struc- 
ture of the PQ WZW term in detail, though its contributions to 7r° — > 77 
vertices have been analyzed 91 . 

Thus we find that the number of LECs in PQ X PT is the same as in 
X PT at LO, and that there is only one more, Lpq, at NLO. But how are 
these LECs related to those of XPT? The answer is simple 6 : they are the 
same! This can be seen by considering correlation functions created by 
sources s,p, ^,r M restricted to the QCD sub-space. At the quark level, 
these are QCD correlators, since valence and ghost contributions cancel 
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identically. Thus they are described by XPT. At the EFT level, one can 
show diagramatically in PQXPT that a similar cancellation occurs, and 
that one can do the calculation using E restricted to the QCD subspace as 
in (123). Inserting this form into £pq 4 ' one finds (i = 1, 10, HEC ignored): 

C^\s,Li,L PQ ) =£( 2 < 4 )(£ QCD ,L0- (127) 

In words, the calculation one does with QCD sources in PQXPT is exactly 
that one would do in XPT. For the results to be equal it must be that 
the Li are equal. This is the key result in PQxPT, for it means that the 
predictions of PQXPT involve only slightly more LECs than those of XPT. 

4.3.5. What about C PQ ? 

Back when we were constructing in XPT, I noted that one possible 
four-derivative term was not independent for N < 3. This is due to Cayley- 
Hamilton relations between traces of finite matrices. Such relations do not 
hold for graded matrices, and so the term is independent in PQXPT. It is 
convenient to write it as 

Opq = str(D„ED v EtD M ED„Et) + 2 B tr(D ll ED v tfD ll ED v tf) 

- str(£>„££>„Et) 2 /2 - stT(D^D^)stv{D^D^) , (128) 

for then it vanishes if E is restricted to its QCD subspace as in (123). This is 
why Lpq does not appear on the right-hand side of eq. (127). Opq does not 
vanish for general E, however, and thus appears in CpQ with a new LEC. 
This additional LEC also appears in standard XPT if N > 4, when the 
Cayley-Hamilton relations become less restrictive. As one goes to higher 
order, the number of additional such operators in PQXPT increases 90 . 

How does this new operator enter into results for measurable quantities 
in PQQCD? It can only contribute to PQ quantities, for the considerations 
above show that it vanishes when restricted to the QCD sub-space. It turns 
out to contribute to PQ inr scattering at NLO, but to PQ and / w only 
at NNLO 90 . Thus its practical impact is small. 

It is worthwhile, however, understanding its origin more deeply. As I 
have repeatedly mentioned, PQQCD allows one to separate individual Wick 
contractions, unlike QCD. For example, ir + K° scattering in QCD has two 
contractions (thin [blue] is u, medium [red] is d, and thick [brown] is s): 
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0*0 



and Opq makes no contribution to this process. We can separate these con- 
tractions in PQQCD using, for example, scattering involving ghost quarks: 




This is the same as the first contraction contributing to QCD scattering, 
up to a sign. Opq does contribute to this correlator. Thus £pq contains 
information about the relative size of two contractions in the QCD process. 



4.4. PQXPT at LO 

With the Lagrangian in hand it is straightforward to develop perturbation 

(2) 

theory. Inserting the expansion (122) into £ P q, we find 

£ P 2 ^ = str(9 M $d M $) + str( x $ 2 ) + . . . (129) 
= tr(9 AI 0(9 M + d^id^ - d^d^rn - d^d^) 
fxv 



tr 



tr(^xv) ~ tr( mmX v)- (130) 



V Xs, 

Here xv,S are the mass matrices in the valence and sea sectors, respectively, 
multiplied by 2Bq. </> is like the pion field in XPT, except that it includes 
both valence and sea quarks. The propagator for "charged" mesons with 
flavor (which can be VV, VS or SS) is 

(p 2 +ml 2 )-\ m\ 2 = ( X i+X2)/2. (131) 

On the other hand, the terms involving the "ghost-ghost" boson <f> have 
unphysical signs. It appears that we are expanding the ghost-ghost sec- 
tor of S about the wrong point, since the potential is maximized. Fur- 
thermore, the kinetic term will not give a convergent functional integral. 
Both these problems result from our earlier decision to use the symmetry 
group G, even though it was inconsistent with convergence. A more careful 
treatment 86,80 ' 88 shows that one should have changed <f> — > i<fi, which solves 
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the convergence problem, at the cost of introducing i's into vertices. XIX In 
perturbation theory we can reshuffle the i's by hand, and work with the 
naive propagator one gets from (130). For "charged" ghost mesons with 
flavor q-Jii one has 

-{p 2 + ml 2 )-\ m 2 12 ^( X i+X2)/2. (132) 

Finally, the NG fermion propagators can have either sign. There are no 
convergence issues for fermions, but signs are important for cancellations. 

What about the "neutral" fields faqi, etc.)? Here we have to implement 
the constraint str($) = tr(0) — tr(</>) = 0. There are two ways to do 
this. The first is simply use a basis of generators which is straceless: $ = 
J2 a ®aT a with str(T a ) = 0. This is analogous to excluding the r( in XPT, 
but is more complicated in PQXPT. In the second method, we remove the 
constraint by including a singlet field, $ — > $ + ^o/v^/V, adding a mass 
term to the action, 

£ PQx - £ PQx + ml str($) 2 /7V , (133) 

and then integrating out <&o by sending m% — ► oo. This is just a trick to 
project out the singlet. To make it formally correct, we must regularize the 
theory with a cut-off so that m\ always exceeds any loop momenta. This 
is the method mostly used in practice as it is simple to implement. 
Using this method, the neutral propagator is obtained from: 

N+2N V 

£ (2) = E <M' ( hA,<h:i + m^%) + (mf5/A0(]T e^jjf + . . (134) 
j=i i 
+1 valence or sea quarks , , 

— 1 ghosts 

The mo term couples all the so that, in particular, neutral sea-quark 
states can contribute to neutral valence propagators. The inversion of the 
kernel is not trivial for general quark masses, but can be accomplished 
using linear algebra tricks 79 ' 6 . I show an example of the result for TV = 3 
non-degenerate sea quarks, after having sent m\ — > oo 



eAj 1 1 {p 2 + Xsi)(p 2 + XS2){p 2 + Xsz) 

u/ P 2 + X t N(p* + Xi){p 2 + Xj) (P 2 + M 2 )(p 2 + M 2 ) 

(136) 



XIX Obtaining a positive ghost propagator docs not imply a restoration of reflection 
positivity. This is now violated by the i's in the vertices. 
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Here M v o and M v are the masses of the sea-sector neutrals, after inclusion 
of 7T° — t] mixing evaluated at LO in XPT (as discussed in the first lecture). 
If we take i,j to be valence labels, and set Xi = Xj ( or simply consider 
i = j), we see the infamous double pole in the second term. It is reduced 
to a single pole if the valence mass equals that of one of the sea quarks, 
Xi = XSj- The form simplifies if the sea quarks are degenerate, 



($..$..)= €i5ii -- iP +XS) (137) 



The residue of the double pole for X i — Xj 1S then (xi — Xs)/N, showing 
how it vanishes in the physical subspace. Setting Xi = Xj = Xs we obtain: 

1 / 1 

P 2 + Xs \ ~N 



(<s>ss$ss) = -2-ttt: ( 1 - - ) ■ ( 138 ) 



This is the correct result in the sea sector, with the 1/N term projecting 
out the 77'. 

Introducing $0 has allowed us to use the basis $y ~ QiQj for neu- 
tral as well as charged states. This means that one can follow the fla- 
vor indices in an unambiguous way through any PQXPT diagram, re- 
sulting in "quark-line diagrams" . This is a useful qualitative tool in 
thinking about calculations. Charged particles propagators are simple: xx 

J ^ 

( 1>, = ± 



%1~*3U - - J -p^ + ( X i+Xi)/2 



1 

Typically one uses solid lines for quarks (distinguishing valence and sea by 
a label), and dashed lines for ghosts. In the diagram above I chose i, j to be 
quark labels. With this notation the neutral propagator is, schematically, 

V + h z^czdcz: h 



where the "hairpin" is the mo vertex. The first term corresponds to the 
single pole in (137), while the remaining diagrams all have a double pole 
(if Xi — Xj)i an d their summation gives the second term in (137). Note 
that the valence and ghost contributions cancel exactly between two hairpin 
vertices. 



c The sign of the propagator is to be determined from eq. (130). 
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4.5. NLO calculations in PQX.PT and outlook 

With the machinery developed above, perturbative calculations in PQXPT 
are straightforward extensions of those in standard XPT. (I stress again 
that this is not true for non-perturbative calculations like those in the 
e— regime.) Let me sketch an example, that of the mass of a pion composed 
of valence quarks VI, V2. The types of quark-line diagrams corresponding 
to the one-loop diagram are: 




Here thin (blue) lines are valence quarks, dashed lines are ghosts, and 
thick (brown) lines are sea quarks. I have used the result that the four- 

(2) 

pion vertices from C P q involve a single supertrace and so give "connected" 
quark-line vertices. The contributions of loops of valence-valence bosons 
cancel those of valence-ghost fermions, as expected from the underlying 
theory. This leaves only loops of valence-sea bosons on the second line 
(which turn out to cancel for m n but not for while the third line shows 
the hairpin contributions. 

The result (simplified by assuming degenerate sea quarks and mvi = 
my 2 = %) is 89 



1 %Xv - xs 



m vv = xv 1 + T7 ln(xv/V 2 ) 



N 



Xv ~ Xs 
NA* 



— [(2L 8 - L 5 ) X v 



(2L 6 - U)Nxs] 



(139) 



The terms proportional to 1/N arise from the loops involving hairpins, 
while the second line shows the analytic terms from the NLO Lagrangian. 
The unphysical nature of the double poles in the hairpin contribution gives 
rise to the "enhanced logarithm" , proportional to xs m (xvO, which diverges 
when my — > at fixed mg, and leads to a breakdown of PQXPT. This is 
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the divergence noted above which requires that one take the chiral limit 
with my/ms fixed. The divergence occurs only at extremely small valence 
quark masses, however, and should not be a problem in practice 6 . 

The analytic terms in (139) show the utility of partial quenching. By 
varying my and mj, and including the chiral logs of the first line in the 
fit, one can separately determine 2L$ — L 5 and 2L 6 — L 4 . This should be 
compared to the setting \v — Xs, so that one is in the physical subspace. 
Then the chiral logs are physical and not enhanced, but varying \v only 
allows a determination of 2L% — L§ + 2Lg — L4. In fact, PQ simulations have 
been used by various groups to determine 2L§ — which is of particular 
interest as its value determines whether it is possible for m u — (which 
would solve the strong CP problem). The answer is clearly negative 3 . 

4.5.1. Status of PQXPT calculations 

It is now standard to extend to PQXPT any XPT calculation relevant for 
extrapolating lattice results. Many quantities have been considered at one- 
loop, XXI including the masses and form-factors of pions, baryons, vector 
mesons, and heavy-light hadrons, structure functions of baryons, the scalar 
two-point function, weak matrix elements (P>k, K — > tttt), the neutron 
electric dipole moment, and pion and nucleon scattering amplitudes. Sim- 
ilarly standard are partially quenched extensions of calculations including 
discretization effects: tmXPT, (rooted) staggered XPT, and mixed-action 
XPT. In most cases this extension is straightforward. The cases where 
it is not involve non-trivial generalizations of normal representations to 
those of graded groups. Two notable examples are octect baryons 92 ' 93 and 
four-fermion matrix elements 94 . Particularly striking examples of the un- 
physical nature of PQQCD are the negative contributions to correlators 
that are strictly positive in QCD, e.g. the scalar two-point function 95 , and 
the appearance of non-unitary effects in two-pion correlators 96 ' 97 . Another 
interesting application of PQQCD is to the calculation of the spectrum of 
the continuum 98 and lattice 99 Dirac operators. 

The MILC studies of pion and kaon properties show the potential power 
of using partial quenching 3 , while at the same time exposing the challenges 
of extrapolating using XPT. With very precise numerical results, and masses 
ranging from m s /10 to m s , an accurate description of their data requires 



XXI I have chosen not to give references for the subsequent list — there is not space for 
the 0(100) that would be needed. 
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not only terms of NLO but also of NNLO and, at the higher masses, of 
NNNLO. The loop contributions are not known beyond NLO in staggered 
XPT, so only analytic NNLO and NNNLO terms are kept. This is clearly a 
phcnomcnological approach, but the key point to keep in mind is that these 
higher order terms have essentially no impact on the resulting extrapolated 
results for physical quantities: they are very small corrections for the phys- 
ical up and down quark masses, and the strange quark is anyway at, or 
close to, its physical value. The MILC fits would not be possible without 
the large number of partially quenched points, and their success provides a 
posteriori justification for the assumptions needed to develop PQXPT. 

The need to work beyond NLO in practice has spurred some heroic 
work from continuum XPT experts: there are now full NNLO (i.e. two- 
loop) calculations for pion and kaon properties in PQXPT 31 ! 

4.5.2. A final example: L7 

I close these lectures with a final example of which I am particularly fond 
and which nicely illustrates the power of PQQCD 6 . This concerns the LEC 
Lj, which multiplies a "two (s)trace term", 



and which contributes to PGB masses only for non-degenerate quarks. Its 
most significant contribution in QCD is to m v , and this leads to violations 
of the GMO relation: 



Obtaining the physical result for m v would be a highly non-trivial check of 
the lattice methodology, as it involves quark-disconnected diagrams with 
intermediate glue. For the same reason, this is a challenging calculation. 
Partial quenching can help in the usual way by providing more data to 
fit, but also by obviating the need to actually do the extrapolation to the 
physical rj . In particular, since L 5 and L 8 are known quite accurately, as 
are the chiral logs, r)' physics is tested, using eq. (141), by any method that 
allows a calculation of L-j. One such method is to calculate the residue of 
the double pole in a disconnected valence- valence correlator: 



4 4 PQ = ■ ■ ^str ( X £t - S X t)2 + . . . 



(140) 



4m 2 K -ml-3m 2 = 



Z2{m 2 K -ml) 2 
3/ 2 



(L 5 — 6L% — I2L7) + known chiral logs . 

(141) 



/ d 3 x(^ VhV1 (t,x)^ V2iV 2(0)} 
J d 3 x($ vl . V 2(t,x)<P V2 ,vi(0)} 



m vl =m v2 



t— >oo 



2M VV 



Vt 



(142) 
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With N = 3 degenerate sea quarks one finds 
v = Xv^Xs -™(l 7 + |0 (xv - Xsf + known chiral logs , (143) 

so Lj can be determined from the term quadratic (and thus even) in the 
deviation from the unquenched theory. The generalization of this formula 
to a 2 + 1 theory has not been worked out, but should be straightforward. 

I like this formula as it gets to the essence of partial quenching: using un- 
physical phenomena (in this case the double pole) to obtain physical results 
(here L7). Of course, we have not removed the need to do the challenging 
calculation of quark-disconnected correlators, but rather have packaged the 
calculation in a way that is more flexible because there are more knobs to 
turn. The result has been very recently extended to NNLO 100 . 
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